Fig. 1A 
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10 20 30 40 50 60 

****** 

BTHKURHD ATGGATAACMTCCGMCATCMTGAATGCATTCCTTATMTTGTTTMGTMCCCTGAA 

flsvnbt.fin ...~.C C..C C..G C..C..C..C..CC.G..C C..G 

bssyn C C. .C C. .G C. .C. .C. .C. .CC.G. .C C. .G 

70 80 90 100 110 120 

****** 

BTHKURHD GTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCAATCGATATTTCCTTG 

flsvnbt.fin . .G. .G. .GC.G. ,C. .C. .GC.C. .C. .G. .C. .C C C. .CAG.C . 

bssyn . .G. .G. .GC.G. .C. .C. .GC.C. .C. .G. .C. .C C C. CAG.C. 

130 140 150 ' 160 170 180 

****** 

BTHKURHD TCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGACTA 

flsynbt.fin AGCCC.G. .CGC . .C. .G. .C.G CC .C. .C. . .C.G. .CG 

bssyn AGC. .G. .C. .G. .C. .GC. . . .C. .G. .C. .G C. .C. .C. .C. . .C.G. .C. .G 

190 200 210 220 230 240 

****** 
BTHKURHD GTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 

flsynbt.fin ..G..CC..C CC. .CC . .AGC .G CC.CG..CC 

bssyn . .G. .C. .C. .C C..C..C..C... AGC . . G C. .C. .G. .G. .G. .C 

250 260 270 280 290 300 

BTHKURHD GMCAGTTAATTAACCMAGMTAGMGMTTCGCTAGGMCCAAGCCATTTCTAGATTA 

^ n n bt - fin ::e:::g:B::g:::::B8:g::8::8::e:::::§g:8:::::i:::::gte88:8g:B 

310 320 330 340 350 360 

* * * * * * 

BTHKURHD GMGGACTMGCMTCTTTATCAMTTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGAT 

flsynbt.fin ..G..C..G C..G..C C CGAGCCCC G..C..C 

bssyn ..G..C..G C..G..C.:...C C. .GAGC. .CC.C G..C..C 

370 380 390 400 410 420 

BTHKURHD CCTACTAATCCAGCATTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTGCC 

flsynbt.fin . .CCCCCC.GC.C.G C. .C..G C C... 

bssyn ..C..C..C..C..CC.GC.C..G CC.G C C. . . 

430 440 450 460 470 480 

^ "k ^ * « * 

BTHKURHD CTTACMCCGCTATTCCTCTTTTTGCAGTTCAMATTATCAAGTTCCTCTTTTATCA^ 

SSS bt,fin :±±:::±±±±±±±±±±±'&'^^'^ 

490 500 510 520 530 540 

BTHKURHD TATGTTCMGCTGCAMTTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAA 

^!! bt - fin :±±±±±^±^^:$:5±±:^::::±±^ 

550 560 570 580 590 600 

BTHKURHD AGGTGGGGATTTGATGCCGCGACTATCMTAGTCGTTATMTGATTTMCTAGGCTTATT 

flsynbt.fin CC C..C..C C..C C. .C. .C. .C. .C. .CC.G. .CC.C .G. .C 

bssyn CC CC.C CC CC.C..C.C..CC.G..CCC.G..C 
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BTHKURHD 
f lsynbt. fin 
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BTHKURHD 
f lsynbt. fin 
bssyn 



BTHKURHD 
f lsynbt. fin 
bssyn 



BTHKURHD , 
f lsynbt . fin 
bssyn 



BTHKURHD 
f lsynbt. fin 
bssyn 



BTHKURHD 
f lsynbt. fin 
bssyn 



GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGA 
• . * . . • * ♦ C • .C. • C ♦ .C. .C« »G» . . • • • ♦ . . . .C. • C • . CC .G. • • • • C • .G. . > ■ »T 
C..C..C..C..C..G C. .C. .CC.G C. .G T 



670 
* 



680 



690 
* 



700 
* 



710 
* 



720 
* 



CCGGATTCTAGAGATTGGATAAGATATAATCMTTTAGMGAGMTTMCACTAACTGTA 

. .C. .CAGCC.C. .C C. .G. .CC. .G. .CC.CC.C. .GC.G. .C. .G. .C..G 

..C..CAGCC.C..C C..G..C..C..G. .CC.CC.C. .GC.G. .C. .G. .C. .G 



770 
* 



780 
* 



730 740 750 760 

* * * * 

TTAGATATCGTTTCTCTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTT 

C.G..C GAGC..G..C..C C. .C. .CC.C. .C. .C. .C .C. .C. .C. .G 

C.G..C GAGC..G..C..C C. .C. .CC.C. .C. .C. .C. .C. .C. .C. .G 



830 
* 



840 
* 



790 800 810 820 

TCCCAATTAACAAGAGAAATTTATACAMCCCAGTATTAGMAATTTTGATGGTAGTTTT 

AG...GC.G..CC.C..G C..C CGC.CCCCCCCC 

AG... GC.G. .CC.C. .G CC C. .GC.G. .G. .C. .C. .C. .C. .C. .C 



890 
* 



900 



850 860 870 880 

CGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTT 

. .C. . .AGC. .C C..G..C..C..CC.C..C..C..CC C. .C. .G 

"c.AGC.C C..G..C..C..CC.C..C..C..CC CC.G 



910 



920 



930 



940 



950 
* 



960 



AACAGTATAACCATCTATACGGATGCTCATAGAGGAGAATATTATTGGTCAGGGCATCAA 
C C C. . CC.C CC. CC.G. .CC .AGC. .CC.G 

:::::c::c:::::::c.c.cccccccc.c.AGccc.G 



970 



980 



990 



1000 

* 



1010 
* 



1020 

* 



ATAATGGCTTCTCCTGTAGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATGGAACT 

. .C CAGC. .CC.C .CAGC .CC.G CCC.CC.C .C 

::c:::..CAGC..CCCCAGCCC.G CCCCCCC 



1070 



1080 
* 



1030 1040 1050 1060 

ATGGGAMTGCAGCTCCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGA 

C.C..T..A..T..G..G..C.C..G..A..G..G..C A CC.C 

C..C..T..A..T..G..G..C..C..G..A..G..G..C A CC.C 



1130 
* 



1140 



1090 1100 1110 1120 

ACATTATCGTCCACTTTATATAGAAGACCTTTTAATATAGGGATMATAATCAACAACTA 

..CCGAGCAC.CCCCC.TC C. .C. .C. .C. .C. .C. .C. .G..G..G 

. .CCGAGCAC .CC.G. .CC.TC CCCCCCCG. .CG 



1190 
* 



1200 
* 



1150 1160 1170 1180 

TCTGTTCTTGACGGGACAGAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTA 

AGC .G..G C..CG..CCCC. .AG. AGC. .CC CAG. . .L. .b 

AGCCC:::C..C..G..C..C..C..C...AG.AGC..CC....CAG...C..G 



Fig. 1C 
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BTHKURHD 

flsynbt.fin 

bssyn 



BTHKURHD 

flsynbt.fin 

bssyn 



BTHKURHD 
flsynbt . fin 
bssyn 



TACAGAAAAAGCGGAACGGTAGATTCGCTGGATGAAATACCGCCACAGAATAACAACGTG 

« . • C . C • • G . • • * . C . . C . • G • • CAGC C«*Gi*C*iCi*T» a ««»C 

...C.C.G C..C..G. .CAGC C..G..C..C..T C 



1270 



1280 



1290 
* 



1300 
* 



1310 
* 



1320 
* 



CCACCTAGGCAAGGATTTAGTCATCGATTAAGCCATGTTTCMTGTTTCGTTCAGGCTTT 

C.A. .G. .C. .C. .C. .C..TC.G C..GAGC C. .CAGT C 

C.A. .G. .C. .C. .C. .C..TC.G C..GAGC C. .CAGT C 



1360 
* 



1370 
* 



1380 



1330 1340 1350 - 

* * * 

AGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCTTGGATACATCGTAGTGCT 

. .C..C..C..C..G..C..C..CC.T..A AGC C 

..C..C..C..C..G..C..C..CC.T..A AGC T..C..C C 



1390 1400 1410 1420 1430 

* * * * * 

BTHKURHD GAATTTAATMTATAATTCCTTCATCA — CAMTTACACAMTACCTTTAACAAAATCTA 
flsynbt.fin . .G. .C. .C. .C. .C. .C. .— . .G. .GC. .G. .C. .C. .G. .C. . CC.C. C. GAGC 
bssyn . .G. .C. .C. .C. .C. .C. .— . .G. .GC. .G. .C. .C. .G. .C. .CC.G. .C. .GAGC. 

1440 1450 1460 1470 1480 

BTHKURHD CTAATCTTGGCTCTGGAACTTCTGTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTC 
flsynbt.fin C. X. X. . .AGC. X. .CAGC. X. X. X. X. X. X. X. X. X. X. X. X. 



1490 
* 



bssyn 



1500 



1510 



1520 



1530 



1540 

* 



1550 

* 



TTCGAAGMCTTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATTACTGCACCATTAT 

ffsynbtTfin .G. .CC.C. .CAGC. .C CAGC. . .C.GC.C. .G. .C. .C. .C. .C . .CC.GA 

■ - .G. .CC.C. .CAGC. .C : . .CAGC. . .C.GC.C. .G. .C. .C. .C. .C. .CC.GA 



BTHKURHD 
flsyn. 
bssyn 



1600 



1610 
* 



1560 1570 1580 1590 

BTHKURHD CACAMGATATCGGGTMGAATTCGCTACGCTTCTACCACAMTTTACAATTCCATACM 

flsynbt.fin GC. .GC.C. .C. .C. .CC.C. .C CAGC C. .CC.G. .G C. .CA 

bssyn GC. .GC.C. .C. .C. .CC.C. .C 



;cagc!!!!!c!!cc.g..g c.ca 



1660 

* 



1670 



1620 1630 1640 1650 

BTHKURHD CAATTGACGGAAGACCTATTAATCAGGGGAATTTTTCAGCAACTATGAGTAGTGGGAGTO 

flsynbt.fin GC.C CC.C. .C. .C. .C C. .C. .CAGC. .C. .C C..C..C..L. 

bssyn GC.C CC.C. .CC.C C.C .CAGC. .C.C C..L..C.C 



1720 
* 



1730 
* 



BTHKURHD 

flsynbt.fin 

bssyn 



1680 1690 1700 1710 

ATTTACAGTCCGGMGCTTTAGGACTGTAGGTTTTACTACTCCGTTTMCTTTTCAMTG 

.CC.G... AG... C CC.C. C.G.. C.C. C.C. C.C CAGC .C 

.CC.G. . .AG. . .C CC.C .C .G. .C .C. .C. .C .C .C CAGC. .C. 



1780 
* 



1790 
* 



BTHKURHD 
flsynbt. fin 
bssyn 



1740 1750 1760 1770 

GATCAAGTGTATTTACGTTMGTGCTCATGTCTTCAATTCAGGCAATGAAGTTTATATAG 

. CAGC. C. C.C. CC.C. C.C .C. .G CAGC C..G..C.C..C. 

.CAGC. .C. .G. .C. .CC.G. .C. .C .C .G CAGC C .G. .G. .C .C 



Fig. ID 



1800 1810 1820 1830 1840 1850 

****** 

BTHKURHD ATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGMTATGATTTAGAAAGAG 

flsynbt.fin .C. .C..C. .G..C. .G. .C..C..G. .G C C. .G..C..CC.G. .G..G. 

bssyn .C. .C. .C. .G. .C. .G. .C. .C. .G. .G C C. .G. .C. .CC.G. .G. .G. 

1860 1870 1880 1890 1900 1910 

****** 

BTHKURHD CACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATG 

flsynbt.fin .T. .G C C C. .CAGCAG. . .C. .G CC.G. .G. .C..C. 

bssyn .T. .G C C C. .CAGCAG. . .C. .G CC.G. .G. .C. .C. 

1920 1930 1940 1950 1960 1970 

BTHKURHD TGACGGATTATCATATTGATCAAGTATCCAATTTAGTTGAGTGTTTATCTGATGAATTTT 

flsynbt.fin . . . .C. .C. .C. .C. .C G. .GAG. . .CC.G. .G... . .CC.GAGC. .C. .G. .C. 

bssyn . . . ,C. .C. .C. .C. .C. . . . 

1980 1990 2000 2010 2020 2030 

BTHKURHD GTCTGGATGAAAAAAAAGAATTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATG 
flsynbt.fin .C C. .G. .G. .G. .GC. .AG G. .G. ; G. .C. 'C- . . .C. -G-C. .C. 

bssyn 

2040 2050 2060 2070 2080 2090 
BTHKURHD AGCGGAATTTACTTCAAGATCCAAACTTTAGAGGGATCAATAGACAACTAGACCGTGGCT 

flsynbt.fin C. .CC.G. .G. .G. .C. .C iSi-L-L'-L' 1-11111-1111 

bssyn 

2100 2110 2120 2130 2140 2150 

BTHKURHD GGAGAGGAAGTACGGATATTACCATCCAAGGAGGCGATGACGTATTCAAAGAGAATTACG 

f lsynbt .fin . . C . C . . C . . C . . C . . C . . C :?:i9lllll ' 1 ' ' ' 11 ' 11-11111-1111 

bssyn " 

2160 2170 2180 2190 2200 2210 

BTHKURHD TTACGCTATTGGGTACCTTTGATGAGTGCTATCCAACGTATTTATATCAAAAMTAGATG 

f lsynbt .fin . G . . C . . GC C C . . C X . X^C . XC^G . L c -^±i±i±i 

bssyn 

2220 2230 2240 2250 2260 2270 

BTHKURHD AGTCGMATTAAAAGCCTATACCCGTTACCAATTMGAGGGTATATCGAAGATAGTCAAG 
f lsynbt .fin . . AGC . . GC . G . . G C . . . . . C ::::: GC : GC : C . . C . . C . ii-*i±-_±'_±L 

bssyn 

2280 2290 2300 2310 2320 2330 
BTHKURHD ACTTAGAMTCTATTTMTTCGCTACAATGCCAAACACGAMCAGTAAATGTGCCAGGTA 
flsynbt.fin . .C.G. .G CC.G. .C. . . . . . . .C. • . . . .G. .C.^.^^.^.^. 

bssyn 

2340 2350 2360 2370 2380 2390 
BTHKURHD CGGGTTCCTTATGGCCGCTTTCAGCCCCMGTCCMTCGGAAAATGTGCCCATCATTCCC 

f lsynbt .fin . C . . CAG . C . G C . . GAGC .... : 9 :: 9:: C :: • :i C ll G l ' -11111-11---11 

bssyn " 



Fig. IE 



2400 2410 2420 2430 2440 2450 

****** 

BTHKURHD ATCATTTCTCCTTGGACATTGATGTTGGATGTACAGACTTAMTGAGGACTTAGGTGTAT 
flsynbt.fin .C. .C. ..AG.C C. .C. .G. .C. .C. .C. . .C.G. .C C.G. .C..G. 

bssyn 

2460 2470 2480 2490 2500 2510 

****** 

BTHKURHD GGGTGATATTCAAGATTAAGACGCAAGATGGCCATGCAAGACTAGGAAATCTAGAATTTC 
flsynbt.fin C C C..G..C C. .CC.C. .G. .C. .C. .G. .G. .C. 

bssyn 

2520 2530 2540 2550 2560 2570 

****** 

BTHKURHD TCGAAGAGAAACCATTAGTAGGAGAAGCACTAGCTCGTGTGAAAAGAGCGGAGAAAAAAT 

flsynbt.fin .G. .G G. .CC.G. .G. .C.G. .C. .G. .C. .C GC.C. : C :::: - G :: G - 

bssyn " 

2580 2590 2600 2610 2620 2630 
****** 

BTHKURHD GGAGAGACAAACGTGAAAAATTGGAATGGGAAACAAATATTGTTTATAAAGAGGCAAAAG 

flsynbt.fin . .C.C G. .C. .G..GC G G. .C..C. - C..G. .C. .G. . •-C._.G. 

bssyn 

2640 2650 2660 2670 2680 2690 
****** 
BTHKURHD MTCTGTAGATGCTTTATTTGTAAACTCTCAATATGATAGATTACAAGCGGATACCAACA 
flsynbt.fin .GAGC. .G. .C. .CC.G. .C. .G. . -AGC. .G^.C^.C^CC.G. -^.C.-C^.^. 

bssyn ~ 

2700 2710 2720 2730 2740 2750 

BTHKURHD TCGCGATGATTCATGCGGCAGATAAACGCGTTCATAGCATTCGAGAAGCTTATCTGCCTG 
flsynbt.fin ....C C. .C. .C. .C. .C. .G G. .C. . . . . ...C. .G. .C. .C. . . . .C. 

bssyn 1 

2760 2770 2780 2790 2800 2810 

BTHKURHD AGCTGTCTGTGATTCCGGGTGTCMTGCGGCTATTTTTGAAGAATTAGAAGGGCGTATTT 
flsynbt.fin AGC C..C..C..G..C..C. -C - - C.C. .G :1 GC.G :: G. .C^.C. 

bssyn 

2820 2830 2840 2850 2860 2870 

BTHKURHD TCACTGCATTCTCCCTATATGATGCGAGAAATGTCATTAAAAATGGTGATTTTAATAATG 

flsynbt.fin C. .C. . .AG. . .G. .C. .C. .CC.C. .C. .G^C. •9nr„ *1 _1_ -11 

bssyn 

2880 2890 2900 2910 2920 2930 

BTHKURHD GCTTATCCTGCTGGAACGTGAAAGGGCATGTAGATGTAGAAGAACAAAACAACCACCGTT 
flsynbt.fin . . C . GAG G..C. .C. .G. .C. .G. .G : .G..G : CA 

bssyn 

2940 2950 2960 2970 2980 2990 

BTHKURHD CGGTCCTTGTTGTTCCGGAATGGGAAGCAGAAGTGTCACAAGAAGTTCGTGTCTGTCCGG 
flsynbt.fin GC. .G. .G. .G. .G. .C. .G G. «C. .G^-.^GC^G^.G. .G_. .C. .G. .C ; .C 

bssyn 



Fig. IF 



3000 3010 3020 3030 3040 3050 
BTHKURHD GTCGTGGCTATATCCTTCGTGTCACAGCGTACAAGGAGGGATATGGAGAAGGTTGCGTAA 

flsynbt.fin .C. .C C G. .C. .G. .C. .C. milEl * El * Ell G lI C ' 11 ' ' G l 

bssyn 

3060 3070 3080 3090 3100 3110 
BTHKURHD CCATTCATGAGATCGAGAACAATACAGACGAACTGAAGTTTAGCAACTGTGTAGAAGAGG 

flsynbt.fin C. .C C ll C l G ' _ '111 11 "1111-11-11-1111 

bssyn 

3120 3130 3140 3150 3160 3170 

BTHKURHD AAGTATATCCAMCMCACGGTMC^ 

flsynbt.fin .G. .G. .C. .C n'EllE'lEll 11 11_11_11_11-11-11-11-------- 

bssyn 

3180 3190 3200 3210 3220 3230 

BTHKURHD AGGGTACGTACACTTCTCGTMTCGAGGATATGAC 

f lsynbt .fin . . . . C . . C CA ^llEll C ' * C * ' C llElllllElllll9ll-llllli------ 

bssyn 

3240 3250 3260; 3270 3280 3290 

BTHKURHD TACCAGCTGATTATGCATCAGCCTATGAAGAAW^GCATATACAGATGGACGAAGAGACA 
f lsynbt .fin . G . . C . . C . . C . . C . : CAGC . . . C . L G L ^ L ^i±i±i±L±L±L 1__1_1111 
bssyn 

3300 3310 3320 3330 3340 3350 

BTHKURHD ATCCTTGTGAATCTAACAGAGGATATGGGGATTACACACCACTACCAGCTGGCTATGTGA 
flsynbt.fin .C. .C. .C. .GAGC. . .C.C. .C. .C. .C. .C. . . . .C. -C..G..C..C. . ...C.... 

bssyn 

3360 3370 3380 3390 3400 3410 

BTHKURHD CAAAAGAATTAGAGTACTTCCCAGAAACCGATAAGGTATGGATTGAGATCGGAGAAACGG 

flsynbt.fin .C. .G. .GC.G :i' •El , ? , lll'_ l_llllllllllllll-ll-H-- 

bssyn 

3420 3430 3440 3450 3460 

BTHKURHD AAGGAACATTCATCGTGGACAGCGTGGAATTACTTCTTATGGAGGAATAA 

flsynbt.fin .G. .C.C GC.G..G..G .G..G 

bssyn ...TG..G 



Fig. 2A 



BTHKURHD 
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BTHKURHD 
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BTHKURHD 
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BTHKURHD 
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BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



10 
* 



20 



30 



40 
* 



50 



60 



ATGGATAACAATCCGAACATCAATGAATGCATTCCTTATAATTGTTTAAGTAACCCTGAA 
C C. .C C. .G C. .C. .C. .C. .CC.G. .C C. .G 



70 

* 



80 

* 



90 

* 



100 



110 

* 



120 
* 



GTAGAAGTATTAGGTGGAGAMGMTAGAAACTGGTTACACCCCMTCGATATTTCCTTG 
..G..G..GC.G..C..C..GC.C..C..G..C..C C C. .CAG.C. . 

130 140 150 160 170 180 

****** 

TCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGACTA 
AGC. .G. .C. .G. .C. .GC C. .G. .C. .G C. .C. .C. .C. . .C.G. .C. .G 



190 



200 



210 



220 



230 



240 
* 



GTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 
. .G. .C. .C..C C. .C. .C. .C. . .AGC. .G C. .C.G. .G. .G. .C 



250 
* 



260 
* 



270 
* 



280 
* 



290 
* 



300 
* 



GAACAGTTAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTA 
. .G. . .C.G. .C GC.C. .C .G. .G CCC G CAGCC.CC.G 



310 



320 330 
* * * 



340 
* 



350 
* 



360 
* 



GAAGGACTAAGCAATCTTTATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGAT 
..G..C..G C.G..C C C. .GAGC. .CCC G. .C. .C 



420 
* 



370 380 390 400 410 

***** 

CCTACTAATCCAGCATTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTGCC 

. .C .C. .C. .C. .CC.GC.C .G C. .C.G C C. .. 



430 
* 



440 
* 



450 
* 



460 
* 



470 
* 



480 
* 



CTTACMCCGCTATTCCTCTTTTTGCAGTTCAAAATTATCAAGTTCCTCTTTTATCAGTA 
..G..C C..C.C.G..C.C.G..G..C..C..G..G..C. .GC.GAGC .G 



540 
* 



490 500 510 520 530 
***** 
TATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAA 
. .C. .G. .G..C .C. .CC.G. .CCGAGC. .GC..C.C..C. .CAGC C .C. .G 



590 
* 



600 
* 



550 560 570 580 

AGGTGGGGATTTGATGCCGCGACTATCMTAGTCGTTATMTGATTTMCTAGGCTTATT 
C.C CCC CC CC .C.C.C.CC.G..CC.C. .G..C 



660 
* 



610 620 630 640 650 

***** 

GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGA 

C CCC. C.G C.C CC.G CG i 



710 
* 



720 
* 



670 680 690 700 

* * * * 

CCGGATTCTAGAGATTGGATMGATATMTCAATTTAGAAGAGMTTMCACTMCTGTA 

..CCAGCCCC CG. . CC.G. .CC CCC GCG.. C.G. .C.G 
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760 
* 



770 
* 



780 
* 



730 740 750 

* * * 

TTAGATATCGTTTCTCTATTTCCGMCTATGATAGTAGAACGTATCCMTTCGAACAGTT 
C.G. .C GAGC..G. .C. .C C. .C. .CC.C. .C. .C. .C. .C. .C. .C..G 



790 
* 



800 



810 
* 



820 



830 



840 
* 



TCCCAATTAACAAGAGAAATTTATACAAACCCAGTATTAGAAAATTTTGATGGTAGTTTT 
AG. . .GC.G. .CC.C. .G C. .C. . .'; .C. .GC.G. .G. .C. .C. .C. .C. .C. .C 



850 



870 

* 



880 



890 



900 



860 

****** 
CGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTT 
..C...AGC..C C..G..C..C..CC.C..C..C..CC C..C..G 

910 920 930 940 950 960 

****** 

MCAGTATAACCATCTATACGGATGCTCATAGAGGAGMTATTATTGGTCAGGGCATCAA 

. . .C. .C C. .C. .C. .C. .CC.C. .C. .G. .C. .C. . .AGC. .C. .C. .G 



970 
* 



980 
* 



990 
* 



1000 
* 



1010 
* 



1020 
* 



ATAATGGCTTCTCCTGTAGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATGGAACT 
..C CAGC..C..C..C..CAGC..C..C..G C. .C. .C. .G. .C. .C. .C 



1030 



1040 



1050 
* 



1060 
* 



1070 
* 



1080 
* 



ATGGGAAATGCAGCTCCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGA 
C. .C. .T. .A. .T. .G. .G. .C. .C. .G. .A. .G. .G. .C A CC.C 



1120 
* 



1130 
* 



1140 
* 



1090 1100 1110 
ACATTATCGTCCACTTTATATAGAAGACCTTTTAATATAGGGATAAATAATCAACAACTA 
. . CC . GAGCAG . . .CC.G. .CC.TC C. .C. .C. .C. .C. .C. .C. .G. .G. .G 



1190 
* 



1200 
* 



1150 1160 1170 1180 

TCTGTTCTTGACGGGACAGMTTTGCTTATGGMCCTCCTCAMTTTGCCATCCGCTGTA 
AGC..G..G C..C..G..C..C..C..C. .AG. AGC. .CC CAG ...C.G 

1210 1220 1230 1240 1250 1260 

TACAGAAAAAGCGGAACGGTAGATTCGCTGGATGAAATACCGCCACAGAATAACAACGTG 
. . .CC. .G C. .C .G. .CAGC C .G. .C. .C. .T C 



1270 
* 



1280 
* 



1290 
* 



1300 
* 



1310 
* 



1320 
* 



CCACCTAGGCMGGATTTAGTCATCGATTAAGCCATGTTTCMTGTTTCGTTCAGGCTTT 
CA..G..C.C.C..C..TCG C. .GAGC C .CAG1 t 



1370 
* 



1380 
* 



1330 1340 1350 1360 
AGTAATAGTAGTGTAAGTATMTAAGAGCTCCTATGTTCTCTTGGATACATCGTAGTGCT 
. .C. .C. .C .C. .G. .C .C. .CCT. .A AGC T. .C .C C 



1420 
* 



1430 
* 



1390 1400 1410 

GAATTTAATAATATAATTCCTTCATCA — CAAATTACACAMTACCTTTAACMAATCTA 
..G..C.C.C.C.C.~..G..GC.G..C.C.G..C.CCG..C .GAOL. 
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1440 



1450 
* 



1460 



1470 



1480 
* 



1490 
* 



CTAATCTTGGCTCTGGAACTTCTGTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTC 
.C. .C. .G. . . AGC. .C. .CAGC. .G. .G. .G. .C. .C. .C. .C. .C. .C. .C. .CC. 

1500 1510 1520 1530 1540 1550 

****** 

TTCGAAGAACTTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATTACTGCACCATTAT 

. G . . CC . C . . CAGC . . C CAGC...C.GC.C..G..C..C..C..C..CC.GA 



1600 
* 



1610 
* 



1560 1570 1580 1590 

CACAAAGATATCGGGTAAGMTTCGCTACGCTTCTACCACMATTTACAATTCCATACAT 
GC. .GC.C. .C. .C. .CC.C. .C CAGC C. .CC.G. .G C. .CA 



1620 1630 1640 1650 1660 1670 

* * * * 



CAATTGACGGAAGACCTATTAATCAGGGGAATTTTTCAGCAACTATGAGTAGTGGGAGTA 
GC. .C CC.C. .C. .C. .C C. .C. .CAGC. .C. .C C. ,C. .CC. 

1680 1690 1700 1710 

ATTTACAGTCCGGMGCTTTAGGACTGTAGGTTTTACTACTCCGTTTMCTTTTCAAATG 
.CC.G. . .AG. . .C CC.C. .CG. .CCCCCC CAGCC. 



1720 
* 



1730 
* 



1780 
* 



1790 
* 



1740 1750 1760 1770 

GATCAAGTGTATTTACGTTAAGTGCTCATGTCTTCAATTCAGGCAATGAAGTTTATATAG 
.CAGC. .CCCCC.CCCCG CAGC CG..CCC. 



1840 
* 



1850 



1800 1810 1820 1830 

ATCGMTTGMTTTGTTCCGGCAGMGTMCCTTTGAGGCAGAATATGATTTAGAAAGAG 
.CCCCCG. .CC.G. .G C CCCCC.CCG. 



1900 
* 



1910 
* 



1860 1870 1880 1890 

CACAMAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATG 
, T . . G C C C CAGCAG ...C..G CC . b . . b . . o . . L . 



1920 



1930 



1940 
* 



TGACGGATTATCATATTGATCAAGTATCC 
CCCCC CG.AG 
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10 20 30 40 50 60 

****** 

synlT . mze ATGGACAACAACCCCAACATCMCGAGTGCATCCCCTACAACTGCCTGAGCMCCCCGAG 

bssyn 

synrul.mod 

70 80 90 100 110 120 

****** 

synlT . mze GTGGAGGTGCTGGGCGGCGAGCGCATCGAGACCGGCTACACCCCCATCGACATCAGCCTG 

bssyn . . . • 

synrul.mod 

130 140 150 - 160 170 180 

****** 

synlT . mze AGCCTGACCCAGTTCCTGCTGAGCGAGTTCGTGCCCGGCGCCGGCTTCGTGCTGGGCCTG 

bssyn 

synrul.mod 

190 200 210 220 230 240 

****** 

synlT . mze GTGGACATCATCTGGGGCATCTTCGGCCCCAGCCAGTGGGACGCCTTCCTGGTGCAGATC 

bssyn 

synrul.mod • 

250 260 270 280 290 300 

****** 

synlT . mze GAGCAGCTGATCAACCAGCGCATCGAGGAGTTCGCCCGCAACCAGGCCATCAGCCGCCTG 

bssyn 

synrul.mod 

310 320 330 340 350 360 

****** 



synlT . mze GAGGGCCTGAGCAACCTGTACCAGATCTACGCCGAGAGCTTCCGCGAGTGGGAGGCCGAC 

bssyn A 

synrul.mod <A 

370 380 390 400 410 420 

****** 

synlT.mze CCCACCAACCCCGCCCTGCGCGAGGAGATGCGCATCCAGTTCAACGACATGAACAGCGCC 

bssyn 

synrul.mod 

430 440 450 460 470 480 

synlT . mze CTGACCACCGCCATCCCCCTGTTCGCCGTGCAGAACTACCAGGTGCCCCTGCTGAGCGTG 

bssyn 

synrul.mod 

490 500 510 520 530 540 

* * * * 

synlT.mze TACGTGCAGGCCGCCMCCTGCACCTGAGCGTGCTGCGCGACGTGAGCGTGTTCGGCCAG 

bssyn £ 

synrul.mod u 

550 560 570 580 590 600 

****** 

synlT.mze CGCTGGGGCTTCGACGCCGCCACCATCMCAGCCGCTACAACGACCTGACCCGCCTGATC 



Fig. 3B 



610 620 630 640 650 660 

****** 

synlT . mze GGCMCTACACCGACCACGCCGTGCGCTGGTACAACACCGGCCTGGAGCGCGTGTGGGGC 

bssyn ♦ T 

synrul.mod . T 

670 680 690 700 710 720 

****** 

synlT.mze CCCGACAGCCGCGACTGGATCCGCTACAACCAGTTCCGCCGCGAGCTGACCCTGACCGTG 

bssyn A.G 

synrul.mod A.G 

730 740 750' 760 770 780 

****** 

synlT.mze CTGGACATCGTGAGCCTGTTCCCCAACTACGACAGCCGCACCTACCCCATCCGCACCGTG 

bssyn 

synrul.mod 

790 800 810 820 830 840 

synlT . mze AGCCAGCTGACCCGCGAGATCTACACCAACCCCGTGCTGGAGAACTTCGACGGCAGCTTC 

bssyn T 

synrul.mod T 

850 860 870 880 890 900 

****** 

synlT . mze CGCGGCAGCGCCCAGGGCATCGAGGGCAGCATCCGCAGCCCCCACCTGATGGACATCCTG 

bssyn 

synrul.mod 

910 920 930 940 950 960 

* * * * * * 

synlT . mze AACAGCATCACCATCTACACCGACGCCCACCGCGGCGAGTACTACTGGAGCGGCCACCAG 

bssyn 

synrul.mod 

970 980 990 1000 1010 1020 

synlT.mze ATCATGGCCAGCCCCGTGGGCTTCAGCGGCCCCGAGTTCACCTTCCCCCTGTACGGCACC 

bssyn C 

synful.mod C 

1030 1040 1050 1060 1070 1080 

synlT.mze ATGGGCAACGCCGCCCCCCAGCAGCGCATCGTGGCCCAGCTGGGCCAGGGCGTGTACCGC 

bssvn T..A..T A A 

synful.mod T..A..T A A 

1090 1100 1110 1120 1130 1140 

synlT . mze ACCCTGAGCAGCACCCTGTACCGCCGCCCCTTCAACATCGGCATCAACAACCAGCAGCTG 

bssyn ? ,,A,, I 

synrul.mod 1..A..1 

1150 1160 1170 1180 1190 1200 

* * * * 

synlT . mze AGCGTGCTGGACGGCACCGAGTTCGCCTACGGCACCAGCAGCAACCTGCCCAGCGCCGTG 
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1210 1220 1230 1240 1250 1260 

****** 

synlT . mze TACCGCAAGAGCGGCACCGTGGACAGCCTGGACGAGATCCCCCCCCAGMCMCMCGTG 

bssyn T 

synrul.mod T 

1270 1280 1290 1300 1310 1320 

****** 

synlT . mze CCCCCCCGCCAGGGCTTCAGCCACCGCCTGAGCCACGTGAGCATGTTCCGCAGCGGCTTC 

bss yn . . A . . T . . A. T ...♦..*♦. . T • • • • • • 

synful.mod ..A..T..A T T 

1330 1340 1350 • 1360 1370 1380 

* * * * * * 

synlT . mze AGCAACAGCAGCGTGAGCATCATCCGCGCCCCCATGTTCAGCTGGATCCACCGCAGCGCC 

bss vn ....•T..A..T • T. .T t • • 

synful.mod T..A..T T T... 

1390 1400 1410 1420 1430 1440 

****** 

s ynlT . mze GAGTTCAACAACATCATCCCCAGCAGCCAGATCACCCAGATCCCCCTGACCAAGAGCACC 

bssyn 

synful.mod • . . * 

1450 1460 1470 1480 1490 1500 

****** 

synlT.mze AACCTGGGCAGCGGCACCAGCGTGGTGAAGGGCCCCGGCTTCACCGGCGGCGACATCCTG 

bssyn • 

synrul.mod 

1510 1520 1530 1540 1550 1560 

****** 

synlT.mze CGCCGCACCAGCCCCGGCCAGATCAGCACCCTGCGCGTGAACATCACCGCCCCCCTGAGC 

bssyn 

synrul.mod . . . . ; 

1570 1580 1590 1600 1610 1620 

****** 
synlT . mze CAGCGCTACCGCGTGCGCATCCGCTACGCCAGCACCACCAACCTGCAGTTCCACACCAGC 

bssyn C 

synrul.mod C 

1630 1640 1650 1660 1670 1680 

****** 

synlT.mze ATCGACGGCCGCCCCATCAACCAGGGCAACTTCAGCGCCACCATGAGCAGCGGCAGCAAC 

bssyn • 

synful.mod • 

1690 1700 1710 1720 1730 1740 

****** 

synlT . mze CTGCAGAGCGGCAGCTTCCGCACCGTGGGCTTCACCACCCCCTTCAACTTCAGCAACGGC 

bssyn • 

synrul.mod * 

1750 1760 1770 1780 1790 1800 
****** 

synlT . mze AGCAGCGTGTTCACCCTGAGCGCCCACGTGTTCAACAGCGGCAACGAGGTGTACATCGAC 

bssyn 

synrul.mod 
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1810 1820 1830 1840 1850 1860 

****** 

synlT . mze CGCATCGAGTTCGTGCCCGCCGAGGTGACCTTCGAGGCCGAGTACGACCTGGAGCGCGCC 

bssyn • a'pt 

synful.mod A.b..i 

1870 1880 1890 1900 1910 1920 

****** 

synlT . mze CAGAAGGCCGTGAACGAGCTGTTCACCAGCAGCAACCAGATCGGCCTGAAGACCGACGTG 

bssyn 

synrul.mod 

1930 1940 1950 - 1960 1970 1980 
****** 
synlT.mze ACCGACTACCACATCGACCAGGTGAGCMCCTGGTGGAGTGCCTGAGCGACGAGTTCTGC 
bssyn T - 

synrul.mod T 

1990 2000 2010 2020 2030 2040 

synlT . mze CTGGACGAGAAGMGGAGCTGAGCGAGAAGGTGAAGCACGC^ 
bssyn 

synrul.mod 

2050 2060 2070 2080 2090 2100 

synlT . mze CGCMCCTGCTGCAGGACCCCMCTTCCGCGGCATCMCCGCCAGCT^ 
bssyn , 

synrul.mod 

2110 2120 2130 2140 2150 2160 

synlT.mze CGCGGCAGCACCGACATCACCATCCAGGGCGGCGACGACGTGTTCAA 
bssyn 

synful.mod * 

2170 2180 2190 2200 2210 2220 

synlT . mze ACCCTGCTGGGCACCTTCGACGAGTGCTACCCCACCTA 
bssyn 

synful.mod 

2230 2240 2250 2260 2270 2280 

synlT . mze AGCMGCTGMGGCCTACACCCGCTACCAGCT 
bssyn 

synrul.mod 

2290 2300 2310 2320 2330 2340 

synlT . mze CTGGAGATCTACCTGATCCGCTACMCGCCM 
bssyn 

synful.mod 

2350 2360 2370 2380 2390 2400 

synlT.mze GGCAGCCTGTGGCCCCTGAGCGCCCCCAGCCCCATC 



Fig. 3E 

2410 2420 2430 2440 2450 2460 

****** 

synlT . mze CACTTCAGCCTGGACATCGACGTGGGCTGCACCGACCTGAACGAGGACCTGGGCGTGTGG 

bssyn 

synful.mod 

2470 2480 2490 2500 2510 2520 

synlT.mze GTGATCTTCMGATCAAGACCCAGGACGGCGACGCCCGCCTGGGCAACCTGGA^ 
bssyn 

synful.mod 

2530 2540 2550 = 2560 2570 2580 

****** 

synlT . mze GAGGAGAAGCCCCTGGTGGGCGAGGCCCTGGCCCGC^ 
bssyn 

synful.mod . 

2590 2600 2610 2620 2630 2640 

* * * * * * 

synlT . mze CGCGACMGCGCGAGAAGCTGGAGTGGGAGACCMCATCGTGTACM 
bssyn 

synful.mod - 

2650 2660 2670 2680 2690 2700 

synlT . mze AGCGTGGACGCCCTGTTCGTGMCAGCCAGTA 
bssyn 

synful.mod 

2710 2720 2730 2740 2750 2760 

synlT.mze GCCATGATCCACGCCGCCGACMGCGCGTGCACAGCATC 
bssyn " m 

synful.mod » 1 

2770 2780 2790 2800 2810 2820 

x 4- * * x « 



synlT . mze CTGAGCGTGATCCCCGGCGTGAACGCCGCCATCTTCGAGGAGCT 
bssyn 

synful.mod 

2830 2840 2850 2860 2870 2880 

synlT.mze ACCGCCTTCAGCCTGTACGACGCCCGCMCGTGATCAAGMCGGCGACT 
bssyn 1 

synful.mod 

2890 2900 2910 2920 2930 2940 

synlT.mze CTGAGCTGCTGGMCGTGMGGGCCACGTGGACGTC 

bssyn 

synful .mod • • • • * * 

2950 2960 2970 2980 2990 3000 

synlT.mze GTGCTGGTGGTGCCCGAGTGGGAGGCCGAGGTGAGCCAGGAGGTGCGCGTGTC 

bssyn 

synful.mod 
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3010 3020 3030 3040 3050 3060 
****** 
synlT.mze CGCGGCTACATCCTGCGCGTGACCGCCTACAAGGAGGGCTACGGCGAGGGCTGCGTGACC 

bssyn 

synful.mod 

3070 3080 3090 3100 3110 3120 

*)c ^ ^ 

synlT . mze ATCCACGAGATCGAGAACAACACCGACGAGCTGAAGTTCAGCAACTGCGTGGAGGAGGAG 
bssyn " 

synful.mod u 

3130 3140 3150 3160 3170 3180 

synlT.mze GTGTACCCCAACMCACCGTGACCTGCMCGACTACACCGCCACCCAGGAG^ 
bssyn 

synful.mod 

3190 3200 3210 3220 3230 3240 
synlT.mze GGCACCTACACCAGCCGCAACCGCGGCTACGACGGCGCCTACGAGAGCAACA^ 
bssyn 

synful.mod 

3250 3260 . 3270 3280 3290 3300 

synlT.mze CCCGCCGACTACGCCAGCGCCTACGAGGAGMGGCCTACACC^ 

bssyn 

synful .mod 

3310 3320 3330 3340 3350 3360 

synlT . mze CCCTGCGAGAGCAACCGCGGCTACGGCGACTACAC^ 
bssyn 

synful.mod 

3370 3380 3390 3400 3410 3420 

synlT.mze AAGGAGCTGGAGTACTTCCCCGAGACCGACMGGTGTGGATCGAGATC^ 
bssyn , 

synful.mod 

3430 3440 3450 3460 

synlT .mze GGCACCTTCATCGTGGACAGCGTGGAGCTGCTGCTGATGGAGGAGTAG 

bssyn " ' " ' " 

synful.mod 
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10 20 30 40 50 60 

****** 

ATGGATMCMTCCGMCATCAATGAATGCATTCCTTATMTTGTTTMGTMCCCTGM 
♦••#tC»»»»iC»»A»»t»»»t«C«»»t««»»»t»A»«C»tCt«C»»G»ti«««»»A««» 
C C..C C..G C..C..C..C..CC.G..C C..G 

70 80 90 100 110 120 

****** 

GTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCAATCGATATTTCCTTG 

. ,T C.T C . C * . T C .T..C Q • • a • a • • 

..G..G..GC.G..C..C..GC.C..C..G..C..C C C..CAG.C. . 

130 140 150 • 160 170 180 

TCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGACTA 

. .CT.G. .A. .G GC.C..C. .G. .C..G. .A G. .C. .TC.C. . . . . . 

Agc.:g::c..g..c..gc....c..g..c..g C..C..C..C...C.G..C..G 

190 200 210 220 230 240 

GTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 

C..C..C T..C A T.....C..G..G...... 

i .G..C. .C.C C..C..C..C...AGC. .G C..C..G..G..G..C 

250 260 270 280 290 300 

GAACAGTTAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTA 

. .G G..C G..G..C. . ..G C... G C.. . ..G..G 

. .G. . .C.G. .C GC.C. .C. .G. .G CC.C G CAGCC.CC.G 

310 320 330 340 350 360 

GAAGGACTAAGCAATCTTTATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGAT 

T.G C. .C C. .T. . . . .GAGC..C. r"r"r 

. .G. .C. .G C. .G. .C. i . . .C C. .GAGC. .CC.C G. .C. .C 

370 380 390 400 410 420 

CCTACTAATCCAGCATTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTGCC 

C TC.CC.C..G..A ,. C C... 

..C..C..C..C..CC.GC.C..G C. .C.G C C... 

430 440 450 460 470 480 

CTTACMCCGCTATTCCTCTTTTTGCAGTTCAAAATTATCAAGTTCCTCTTTTATCAGTA 

490 500 510 520 530 540 

TATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAA 
C A. .T. . .C.T. .CC.CAGC. .GC.TC C. . .AGC .G. . . 

::c::g::g::c..c..cc.g..cc.gagc..gc..c.c..c.cagc c.c.g 

550 560 570 580 590 600 

aggtggggatttgatgccgcgactatcmtagtcgttatmtgatttaactaggcttatt 
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610 
* 



620 
* 



630 
* 



640 
* 



650 
* 



660 
* 



GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGA 
■ • A ■ • • • • C • • C • • C • • C • • • • • T * » T • • • • * •• • C • • T • • C • ♦ G • * • • • • C ♦ • • • • T 
C. .C. .C. .C. .C. .G C. .C. .CC.G C. .G T 



670 
* 



680 
* 



690 



700 
* 



710 
* 



720 
* 



CCGGATTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTA 

, ,T ( », ttt ,» i ,»»T««»t»C«»C««G».C«»Gt«««»»t«G»»C»iC««A«»T 

. .C. .CAGCC.C. .C C..G. .C. .C. .G. .CC.CC.C. .GC.G. .C. .G. .C. .G 

730 740 750 < 760 770 780 

****** 

TTAGATATCGTTTCTCTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTT 

..G..C..T..G C..C CTCC C. .C. .T. .C. .T. . . . .G 

C.G..C GAGC..G..C..C C. .C. .CC.C. .C. .C. .C. .C. .C. .C. .G 



830 
* 



840 
* 



790 800 810 820 

* * * * 

TCCCAATTAACAAGAGAAATTTATACAAACCCAGTATTAGAAAATTTTGATGGTAGTTTT 

C.T..C C T TC.T..G..C..C..C.....C..C 

AG GC.G.. CC.C. G C..C C. .GC.G. .G. .C. .C. .C. .C. .C. .C 



890 
* 



900 
* 



850 860 870 880 

CGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTT 

..T..T..T..C..A..T..C CTCC. .C C . . . .C C. .CT.G 
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64 ATGGACCTGC TGCCCGACGC CCGCATCGAG GACAGCCTGT GCATCGCCGA GGGCAACAAC 
MetAspLeu LeuProAsp AlaArglleGlu AspSerLeu CysIleAla GluGlyAsnAsn 

124 ATCGACCCCT TCGTGAGCGC CAGCACCGTG CAGACCGGCA TCAACATCGC CGGCCGCATC 
IleAspPro PheValSer AlaSerThrVal GlnThrGly IleAsnlle AlaGlyArglle 

184 CTGGGCGTGC TGGGCGTGCC CTTCGCCGGC CAGCTGGCCA GCTTCTACAG CTTCCTGGTG 
LeuGlyVal LeuGlyVal ProPheAlaGly GlnLeuAla SerPheTyr SerPheLeuVal 

244 GGCGAGCTGT GGCCCCGCGG CCGCGACCAG TGGGAGATCT TCCTGGAGCA CGTGGAGCAG 
GlyGluLeu TrpProArg GlyArgAspGln TrpGluIle PheLeuGlu HisValGluGln 

304 CTGATCAACC AGCAGATC AC CGAGAACGCC CGCAACACCG CCCTGGCCCG CCTGCAGGGC 
LeuIleAsn GlnGlnlle ThrGluAsnAla ArgAsnThr AlaLeuAla ArgLeuGlnGly 

364 CTGGGCGACA GCTTCCGCGC CTACCAGCAG AGCCTGGAGG ACTGGCTGGA GAACCGCGAC 
LeuGlyAsp SerPheArg AlaTyrGlnGln SerLeuGlu AspTrpLeu GluAsnArgAsp 

424 GACGCCCGCA CCCGCAGCGT GCTGTACACC CAGTACATCG CCCTGGAGCT GGACTTCCTG 
AspAlaArg ThrArgSer ValLeuTyrThr GlnTyrlle AlaLeuGlu LeuAspPheLeu 

4fi4 AACGCCATGC CCCTGTTCGC CATCCGCAAC CAGGAGGTGC CCCTGCTGAT GGTGTACGCC 
AsnAlaMet ProLeuPhe AlalleArgAsn GlnGluVal ProLeuLeu MetValTyrAla 

544 CAGGCCGCCA ACCTGCACCT. GCTGCTGCTG CGCGACGCCA GCCTGTTCGG CAGCGAGTTC 
GlnAlaAla AsnLeuHis LeuLeuLeuLeu ArgAspAla SerLeuPhe GlySerbluFne 

fin & rrrr<?rACCA GCTAGGAGAT CCAGCGCTAC TACGAGCGCC AGGTGGAGCG CACCCGCGAC 
604 GlySSTh?S^^Ile^Sk?gTyr TyrGluArg GlnValGlu ArgThrArgAsp 

664 ^^^s^a^ «l g «^^^^ 

724 SS^M^ 

784 CTGGTGGCCC TGTTCCCCAG CTACGACACC CGCACCTACC CCATCAACAC CAGCGCCCAG 
LeuValAla LeuPhePro SerTyrAspThr ArgThrTyr ProIleAsn ThrSerAlaGin 

844 CTGACCCGCG AGGTGTACAC CGACGCCATC GGCGCCACCG GCGTGAACAT GGCCAGCATG 
LeuThrArg GluValTyr ThrAspAlalle GlyAlaThr GlyValAsn MetAlaSerMet 

904 AACTGGTACA ACAACAACGC CCCCAGCTTC AGCGCCATCG AGGCCGCCGC CATCCGCAGC 
AsnTrpTyr AsnAsnAsn AlaProSerPhe SerAlalle GluAlaAla AlalleArgber 

Qfi4 CCCCACCTGC TGGACTTCCT GGAGCAGCTG ACCATCTTCA GCGCCAGCAG CCGCTGGAGC 
ProHisLeu LeuAspPhe LeuGluGlnLeu ThrllePhe SerAlaSer SerArgTrpSer 

m?4 AACACCCGCC ACATGACCTA CTGGCGCGGC CACACCATCC AGAGCCGCCC CATCGGCGGC 
AsnThrArg HisMetThr TyrTrpArgGly HisThrlle GlnSerArg ProIleGlyGly 
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1084 GGCCTGAACA CCAGCACCCA CGGCGCCACC AACACCAGCA TCAACCCCGT GACCCTGCGC 
GlyLeuAsn ThrSerThr HisGlyAlaThr AsnThrSer IleAsnPro ValThrLeuArg 

1144 TTCGCCAGCC GCGACGTGTA CCGCACCGAG AGCTACGCCG GCGTGCTGCT GTGGGGCATC 
PheAlaSer ArgAspVal TyrArgThrGlu SerTyrAla GlyValLeu LeuTrpGlylle 

1204 TACCTGGAGC CCATCCACGG CGTGCCCACC GTGCGCTTCA ACTTCACCAA CCCCCAGAAC 
TyrLeuGlu ProIleHis GlyValProThr ValArgPhe AsnPheThr AsnProGlnAsn 

1264 ATCAGCGACC GCGGCACCGC CAACTACAGC CAGCCCTACG AGAGCCCCGG CCTGCAGCTG 
IleSerAsp ArgGlyThr AlaAsnTyrSer GlnProTyr GluSerPro GlyLeuGlnLeu 

1324 AAGGACAGCG AGACCGAGCT GCCCCCCGAG ACCACCGAGC GCCCCAACTA CGAGAGCTAC 
LysAspSer GluThrGlu LeuProProGlu ThrThrGlu ArgProAsn TyrGluSerTyr 

1384 AGCCACCGCC TGAGCCACAT CGGCATCATC CTGCAGAGCC GCGTGAACGT GCCCGTGTAC 
SerHisArg LeuSerHis IleGlyllelle LeuGlnSer ArgValAsn ValProValTyr 

1444 AGCTGGACCC ACCGCAGCGC CGACCGCACC AACACCATCG GCCCCAACCG CATCACCCAG 
SerTrpThr HisArgSer AlaAspArgThr AsnThrlle GlyProAsn ArglleThrGln 

1504 ATCCCCATGG TGAAGGCCAG CGAGCTGCCC CAGGGCACCA CCGTGGTGCG CGGCCCCGGC 
IleProMet ValLysAla SerGluLeuPro GlnGlyThr ThrValVal ArgGlyProGly 

1564 TTCACCGGCG GCGACATCCT GCGCCGCACC AACACCGGCG GCTTCGGCCC CATCCGCGTG 
PheThrGly GlyAspIle LeuArgArgThr AsnThrGly GlyPheGly ProIleArgVal 

1624 ACCGTGAACG GCCCCCTGAC CCAGCGCTAC CGCATCGGCT TCCGCTACGC CAGCACCGTG 
ThrValAsn GlyProLeu ThrGlnArgTyr ArglleGly PheArgTyr AlaSerThrVal 

1684 GACTTCGACT TCTTCGTGAG CCGCGGCGGC ACCACCGTGA ACAACTTCCG CTTCCTGCGC 
AspPheAsp PhePheVal SerArgGlyGly ThrThrVal AsnAsnPhe ArgPheLeuArg 

1744 ACCATGAACA GCGGCGACGA GCTGAAGTAC GGCAACTTCG TGCGCCGCGC CTTCACCACC 
ThrMetAsn SerGlyAsp GluLeuLysTyr GlyAsnPhe ValArgArg AlaPheThrThr 

1804 CCCTTCACCT TCACCCAGAT CCAGGACATC ATCCGCACCA GCATCCAGGG CCTGAGCGGC 
ProPheThr PheThrGln IleGlnAspIle IleArgThr SerlleGln GlyLeuSerGly 

1864 AACGGCGAGG TGTACATCGA CAAGATCGAG ATCATCCCCG TGACCGCCAC CTTCGAGGCC 
AsnGlyGlu ValTyrlle AspLysIleGlu IlellePro ValThrAla ThrPheGluAla 

1924 GAGTACGACC TGGAGCGCGC CCAGGAGGCC GTGAACGCCC TGTTCACCAA CACCAACCCC 
GluTyrAsp LeuGluArg AlaGlnGluAla ValAsnAla LeuPheThr AsnThrAsnPro 

1984 CGCCGCCTGA AGACCGACGT GACCGACTAC CACATCGACC AGGTGAGCAA CCTGGTGGCC 
ArgArgLeu LysThrAsp ValThrAspTyr HisIleAsp GlnValSer AsnLeuValAla 

2044 TGCCTGAGCG ACGAGTTCTG CCTGGACGAG AAGCGCGAGC TGCTGGAGAA GGTGMGTAC 
CysLeSser AspGluPhe CysLeuAspGlu LysArgGlu LeuLeuGlu LysValLysTyr 
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2104 GCCMGCGCC TGAGCGACGA GCGCMCCTG CTGCAGGACC CCMCTTCAC CAGCATCMC 
AlaLysArg LeuSerAsp GluArgAsnLeu LeuGlnAsp ProAsnPhe ThrSerlleAsn 

2164 AAGCAGCCCG ACTTCATCAG CACCAACGAG CAGAGCAACT TCACCAGCAT CCACGAGCAG 
LysGlnPro AspPhelle SerThrAsnGlu GlnSerAsn PheThrSer IleHisGluGln 

2224 AGCGAGCACG GCTGGTGGGG CAGCGAGAAC ATCACCATCC AGGAGGGCAA CGACGTGTTC 
SerGluHis GlyTrpTrp GlySerGluAsn IleThrlle GlnGluGly AsnAspValPhe 

2284 AAGGAGAACT ACGTGACCCT GCCCGGCACC TTCAACGAGT GCTACCCCAC CTACCTGTAC 
LysGluAsn TyrValThr LeuProGlyThr PheAsnGlu CysTyrPro ThrTyrLeuTyr 

2344 CAGAAGATCG GCGAGAGCGA GCTGAAGGCC TACACCCGCT ACCAGCTGCG CGGCTACATC 
GlnLysIle GlyGluSer GluLeuLysAla TyrThrArg TyrGlnLeu ArgGlyTyrlle 

2404 GAGGACAGCC AGGACCTGGA GATCTACCTG ATCCGCTACA ACGCCAAGCA CGAGACCCTG 
GluAspSer GlnAspLeu GluIleTyrLeu IleArgTyr AsnAlaLys HisGluThrLeu 

2464 GACGTGCCCG GCACCGAGAG CCTGTGGCCC CTGAGCGTGG AGAGCCCCAT CGGCCGCTGC 
AspValPro GlyThrGlu SerLeuTrpPro LeuSerVal GluSerPro IleGlyArgCys 

2524 GGCGAGCCCA ACCGCTGCGC CCCCCACTTC GAGTGGAACC CCGACCTGGA CTGCAGCTGC 
GlyGluPro AsnArgCys AlaProHisPhe GluTrpAsn ProAspLeu AspCysSerCys 

2584 CGCGACGGCG AGAAGTGCGC CCACCACAGC CACCACTTCA GCCTGGACAT CGACGTGGGC 
ArgAspGly GluLysCys AlaHisHisSer HisHisPhe SerLeuAsp IleAspValGly 

2644 TGCACCGACC TGCACGAGAA CCTGGGQGTG TGGGTGGTGT TCAAGATCAA GACCCAGGAG 
CysThrAsp LeuHisGlu AsnLeuGlyVal TrpValVal PheLysIle LysThrGlnGlu 

2704 GGCCACGCCC GCCTGGGCAA CCTGGAGTTC ATCGAGGAGA AGCCCCTGCT GGGCGAGGCC 
GlyHisAla ArgLeuGly AsnLeuGluPhe IleGluGlu LysProLeu LeuGlyGluAla 

2764 CTGAGCCGCG TGAAGCGCGC CGAGAAGAAG TGGCGCGACA AGCGCGAGAA GCTGCAGCTG 
LeuSerArg ValLysArg AlaGluLysLys TrpArgAsp LysArgGlu LysLeuGlnLeu 

2824 GAGACCAAGC GCGTGTACAC CGAGGCCAAG GAGGCCGTGG ACGCCCTGTT CGTGGACAGC 
GluThrLys ArgValTyr ThrGluAlaLys GluAlaVal AspAlaLeu PheValAspSer 

2884 CAGTACGACC GCCTGCAGGC CGACACCAAC ATCGGCATGA TCCACGCCGC CGACAAGCTG 
GlnTyrAsp ArgLeuGln AlaAspThrAsn IleGlyMet IleHisAla AlaAspLysLeu 

2944 GTGCACCGCA TCCGCGAGGC CTACCTGAGC GAGCTGCCCG TGATCCCCGG CGTGAACGCC 
ValHisArg IleArgGlu AlaTyrLeuSer GluLeuPro ValllePro GlyValAsnAla 

3004 GAGATCTTCG AGGAGCTGGA GGGCCACATC ATCACCGCCA TCAGCCTGTA CGACGCCCGC 
GluIlePhe GluGluLeu GluGlyHisIle IleThrAla IleSerLeu TyrAspAlaArg 
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3064 AACGTGGTGA AGAACGGCGA CTTCMCMC 
AsnValVal LysAsnGly AspPheAsnAsn 

3124 GTGGACGTGC AGCAGAGCCA CCACCGCAGC 
ValAspVal GlnGlnSer HisHisArgSer 

3184 GTGAGCCAGG CCGTGCGCGT GTGCCCCGGC 
ValSerGln AlaValArg ValCysProGly 

3244 AAGGAGGGCT ACGGCGAGGG CTGCGTGACC 
LysGluGly TyrGlyGlu GlyCysValThr 

3304 CTGAAGTTCA AGAACCGCGA GGAGGAGGAG 
LeuLysPhe LysAsnArg GluGluGluGlu 

3364 GACTACACCG CCCACCAGGG CACCGCCGGC 
AspTyrThr AlaHisGln GlyThrAlaGly 

3424 GGCTACGAGG ACGCCTACGA GGTGGACACC 
GlyTyrGlu AspAlaTyr GluValAspThr 

3484 GAGGAGGAGA CCTACACCGA CGTGCGCCGC 
GluGluGlu ThrTyrThr AspValArgArg 

3544 GTGAACTACC CCCCCGTGCC CGCCGGCTAC 
ValAsnTyr ProProVal ProAlaGlyTyr 

3604 ACCGACACCG TGTGGATCGA GATCGGCGAG 
ThrAspThr ValTrpIle GluIleGlyGlu 

3664 GAGCTGCTGC TGATGGAGGA GTAG 
GluLeuLeu LeuMetGlu Glu— 



GGCCTGACCT GCTGGAACGT GAAGGGCCAC 
GlyLeuThr CysTrpAsn ValLysGlyHis 

GACCTGGTGA TCCCCGAGTG GGAGGCCGAG 
AspLeuVal IleProGlu TrpGluAlaGlu 

TGCGGCTACA TCCTGCGCGT GACCGCCTAC 
CysGlyTyr IleLeuArg ValThrAlaTyr 

ATCCACGAGA TCGAGAACAA CACCGACGAG 
IleHisGlu IleGluAsn AsnThrAspGlu 

GTGTACCCCA CCGACACCGG CACCTGCAAC 
ValTyrPro ThrAspThr GlyThrCysAsn 

TGCGCCGACG CCTGCAACAG CCGCAACGCC 
CysAlaAsp AlaCysAsn SerArgAsnAla 

ACCGCCAGCG TGAACTACM GCCCACCTAC 
ThrAlaSer ValAsnTyr LysProThrTyr 

GACAACCACT GCGAGTACGA CCGCGGCTAC 
AspAsnHis CysGluTyr AspArgGlyTyr 

GTGACCAAGG AGCTGGAGTA CTTCCCCGAG 
ValThrLys GluLeuGlu TyrPheProGlu 

ACCGAGGGCA AGTTCATCGT GGACAGCGTG 
ThrGluGly LysPhelle ValAspSerVal 
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SEQUENCE OF THE FULL-LENGTH HYBRID SYNTHETIC/NATIVE CRYIA(B) CHIMERIC GENE 
The fusion point between the synthetic and native coding sequences is 
indicated by a slash (/) in the sequence. 

1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
MetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu IleAsnGln ArglleGluGlu PheAlaArg AsnGlnAla IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr AlaGluSer PheArgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GAACAGCGCC 
ProThrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp MetAsnSerAla 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
LeuThrThr AlallePro LeuPheAlaVal GlnAsnTyr GlnValPro LeuLeuSerVal 

481 TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
TyrValGln AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer ValPheGlyGln 

541 CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
ArgTrpGly PheAspAla AlaThrlleAsn SerArgTyr AsnAspLeu ThrArgLeuIle 

601 GGCAACTACA CCGACCACGC CGTGCGCUGG TACAACACCG GCCTGGAGCG CGTGTGGGGT 
GlyAsnTyr ThrAspHis AlaValArgTrp TyrAsnThr GlyLeuGlu ArgValTrpGly 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
ProAspSer ArgAspTrp IleArgTyrAsn GlnPheArg ArgGluLeu ThrLeuThrVal 

721 CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
LeuAspIle ValSerLeu PheProAsnTyr AspSerArg ThrTyrPro IleArgThrVal 

781 AGCCAGCTGA CCCGCGAGAT TTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
SerGlnLeu ThrArgGlu IleTyrThrAsn ProValLeu GluAsnPhe AspGlySerPhe 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHisLeu MetAspIleLeu 

901 AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
AsnSerlle ThrlleTyr ThrAspAlaHis ArgGlyGlu TyrTyrTrp SerGlyHisGln 
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961 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC 
IleMetAla SerProVal GlyPheSerGly 

1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC 
MetGlyAsn AlaAlaPro GlnGlnArglle 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT 
ThrLeuSer SerThrLeu TyrArgArgPro 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC 
SerValLeu AspGlyThr GluPheAlaTyr 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG 
TyrArgLys SerGlyThr ValAspSerLeu 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG 
ProProArg GlnGlyPhe SerHisArgLeu 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA 
SerAsnSer SerValSer IlelleArgAla 

1381 GAGTTCAACA ACATCATCCC CAGCAGCCAG 
GluPheAsn Asnllelle ProSerSerGln 

1441 AACCTGGGCA GCGGCACCAG CGTGGTGAAG 
AsnLeuGly SerGlyThr SerValValLys 

1501 CGCCGCACCA GCCCCGGCCA GATCAGCACC 
ArgArgThr SerProGly GlnlleSexThr 

1561 CAGCGCTACC GCGTCCGCAT CCGCTACGCC 
GlnArgTyr ArgValArg IleArgTyrAla 

1621 ATCGACGGCC GCCCCATCAA CCAGGGCAAC 
IleAspGly ArgProIle AsnGlnGlyAsn 

1681 CTGCAGAGCG GCAGCTTCCG CACCGTGGGC 
LeuGlnSer GlySerPhe ArgThrValGly 

1741 AGCAGCGTGT TCACCCTGAG CGCCCACGTG 
SerSerVal PheThrLeu SerAlaHisVal 

1801 CGCATCGAGT TCGTGCCCGC CGAGGTGACC 
ArglleGlu PheValPro AlaGluValThr 

18 61 CAGAAGGCCG TGAACGAGCT GTTCACCAGC 
GlnLysAla ValAsnGlu LeuPheThrSer 



CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
PrqGluPhe ThrPhePro LeuTyrGlyThr 

GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
ValAlaGln LeuGlyGln GlyValTyrArg 

TTCAACATCG GCATCAACAA CCAGCAGCTG 
PheAsnlle GlylleAsn AsnGlnGlnLeu 

GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
GlyThrSer SerAsnLeu ProSerAlaVal 

GACGAGATCC CCCCTCAGAA CAACAACGTG 
AspGluIle ProProGln AsnAsnAsnVal 

AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
SerHisVal SerMetPhe ArgSerGlyPhe 

CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
ProMetPhe SerTrpIle HisArgSerAla 

ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
IleThrGln IleProLeu ThrLysSerThr 

GGCCCCGGCT TCACCGGCGG CGACATCCTG 
GlyProGly PheThrGly GlyAspIleLeu 

CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
LeuArgVal AsnlleThr AlaProLeuSer 

AGCACCACCA ACCTGCAGTT CCACACCAGC 
SerThrThr AsnLeuGln PheHisThrSer 

TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
PheSerAla ThrMetSer SerGlySerAsn 

TTCACCACCC CCTTCAACTT CAGCAACGGC 
PheThrThr ProPheAsn PheSerAsnGly 

TTCAACAGCG GCAACGAGGT GTACATCGAC 
PheAsnSer GlyAsnGlu ValTyrlleAsp 

TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
PheGluAla GluTyrAsp LeuGluArgAla 

AGCAACCAGA TCGGCCTGAA GACCGACGTG 
SerAsnGln IleGlyLeu LysThrAspVal 
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1921 ACCGACTACC ACATCGAT/CA AGTATCCAAT TTAGTTGAGT GTTTATCTGATGAATTTTGT 
ThrAspTyr HisIleAsp/GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 CTGGATGAAA AAAMGAATT GTCCGAGAAA GTCAAACATG CGAAGCGACT TAGTGATGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 

2041 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA GACAACTAGA CCGTGGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGTT 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVal 

2161 ACGCTATTGG GTACCTTTGA TGAGTGCTAT CCAACGTATT TATATCAAAA AATAGATGAG 
ThrLeuLeu GlyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIleAspGlu 

2221 TCGAAATTAA AAGCCTATAC CCGTTACCAA TTAAGAGGGT ATATCGAAGA TAGTCAAGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAsp 

2281 TTAGAAATCT ATTTAATTCG CTACAATGCC AAACACGAAA CAGTAAATGT GCCAGGTACG 
LeuGluIle TyrLeuIle ArgTyrAsnAla LysHisGlu ThrValAsn ValProGlyThr 

2341 GGTTCCTTAT GGCCGCTTTC AGCCCCAAGT CCAATCGGAA AATGTGCCCA TCATTCCCAT 
GlySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysAla HisHisSerHis 

2401 CATTTCTCCT TGGACATTGA TGTTGGATGT ACAGACTTAA ATGAGGACTT AGGTGTATGG 
HisPheSer LeuAspIle AspValGlyCys ThrAspLeu AsnGluAsp LeuGlyValTrp 

2461 GTGATATTCA AGATTAAGAC GCAAGATGGC CATGCAAGAC TAGGAAATCT AGAATTTCTC 
ValllePhe LysIleLys ThrGlnAspGly HisAlaArg LeuGlyAsn LeuGluPheLeu 

2521 GAAGAGAAAC CATTAGTAGG AGAAGCACTA GCTCGTGTGA AAAGAGCGGA GAAAAAATGG 
GluGluLys ProLeuVal GlyGluAlaLeu AlaArgVal LysArgAla GluLysLysTrp 

2581 AGAGACAAAC GTGAAAAATT GGAATGGGAA ACAAATATTG TTTATAAAGA GGCAAAAGAA 
ArgAspLys ArgGluLys LeuGluTrpGlu ThrAsnlle ValTyrLys GluAlaLysGlu 

2641 TCTGTAGATG CTTTATTTGT AAACTCTCAA TATGATAGAT TACAAGCGGA TACCAACATC 
SerValAsp AlaLeuPhe ValAsnSerGln TyrAspArg LeuGlnAla AspThrAsnlle 

2701 GCGATGATTC ATGCGGCAGA TAAACGCGTT CATAGCATTC GAGAAGCTTA TCTGCCTGAG 
Ala e He HisAlaAla AspLysArgVal HisSerlle ArgGluAla TyrLeuProGlu 

2761 CTGTCTGTGA TTCCGGGTGT CAATGCGGCT ATTTTTGAAG AATTAGAAGG GCGTATTTTC 
LeuSerVal IleProGly ValAsnAlaAla IlePheGlu GluLeuGlu GlyArgllePhe 

2821 ACTGCATTCT CCCTATATGA TGCGAGAAAT GTCATTAAAA ATGGTGATTT TAATAATGGC 
ThrAlaPhe SerLeuTyr AspAlaArgAsn VallleLys AsnGlyAsp PheAsnAsnGly 
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2881 TTATCCTGCT GGAACGTGAA AGGGCATGTA 
LeuSerCys TrpAsnVal LysGlyHisVal 

2941 GTCCTTGTTG TTCCGGAATG GGAAGCAGAA 
ValLeuVal ValProGlu TrpGluAlaGlu 

3001 CGTGGCTATA TCCTTCGTGT CACAGCGTAC 
ArgGlyTyr IleLeuArg ValThrAlaTyr 

3061 ATTCATGAGA TCGAGAACAA TACAGACGAA 
IleHisGlu IleGluAsn AsnThrAspGlu 

3121 GTATATCCAA ACAACACGGT AACGTGTAAT 
ValTyrPro AsnAsnThr ValThrCysAsn 

3181 GGTACGTACA CTTCTCGTAA TCGAGGATAT 
GlyThrTyr ThrSerArg AsnArgGlyTyr 

3241 CCAGCTGATT ATGCATCAGC CTATGAAGAA 
ProAlaAsp TyrAlaSer AlaTyrGluGlu 

3301 CCTTGTGAAT CTAACAGAGG ATATGGGGAT 
ProCysGlu SerAsnArg GlyTyrGlyAsp 

3361 AAAGAATTAG AGTACTTCCC AGAAACCGAT 
LysGluLeu GluTyrPhe ProGluThrAsp 

3421 GGAACATTCA TCGTGGACAG CGTGGAATTA 
GlyThrPhe IleValAsp SerValGluLeu 



GATGTAGAAG AACAAAACAA CCACCGTTCG 
AspValGlu GluGlnAsn AsnHisArgSer 

GTGTCACAAG AAGTTCGTGT CTGTCCGGGT 
ValSerGln GluValArg ValCysProGly 

AAGGAGGGAT ATGGAGAAGG TTGCGTAACC 
LysGluGly TyrGlyGlu GlyCysValThr 

CTGAAGTTTA GCAACTGTGT AGAAGAGGAA 
LeuLysPhe SerAsnCys ValGluGluGlu 

GATTATACTG CGACTCAAGA AGAATATGAG 
AspTyrThr AlaThrGln GluGluTyrGlu 

GACGGAGCCT ATGAAAGCAA TTCTTCTGTA 
AspGlyAla TyrGluSer AsnSerSerVal 

AAAGCATATA CAGATGGACG AAGAGACAAT 
LysAlaTyr ThrAspGly ArgArgAspAsn 

TACACACCAC TACCAGCTGG CTATGTGACA 
TyrThrPro LeuProAla GlyTyrValThr 

AAGGTATGGA TTGAGATCGG AGAAACGGAA 
LysValTrp IleGluIle GlyGluThrGlu 

CTTCTTATGG AGGAATAA 
LeuLeuMet GluGlu 



Fig. 9A 

1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
MetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu IleAsnGln ArglleGluGlu PheAlaArg AsnGlnAla IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr AlaGluSer PheArgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GMCAGCGCC 
ProThrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp MetAsnSerAla 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
LeuThrThr AlallePro LeuPheAlaVal GlnAsnTyr GlnValPro LeuLeuSerVal 

481 TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
TyrValGln AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer ValPheGlyGln 

541 CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
ArgTrpGly PheAspAla AlaThrlleAsn SerArgTyr AsnAspLeu ThrArgLeuIle 

601 GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGT 
GlyAsnTyr ThrAspHis AlaValArgTrp TyrAsnThr GlyLeuGlu ArgValTrpGly 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
ProAspSer ArgAspTrp IleArgTyrAsn GlnPheArg ArgGluLeu ThrLeuThrVal 

721 CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
LeuAspIle ValSerLeu PheProAsnTyr AspSerArg ThrTyrPro IleArgThrVal 

781 AGCCAGCTGA CCCGCGAGAT TTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
SerGlnLeu ThrArgGlu IleTyrThrAsn ProValLeu GluAsnPhe AspGlySerPhe 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHisLeu MetAspIleLeu 

901 AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
AsnSerlle ThrlleTyr ThrAspAlaHis ArgGlyGlu TyrTyrTrp SerGlyHisGln 

961 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
IleMetAla SerProVal GlyPheSerGly ProGluPhe ThrPhePro LeuTyrGlyThr 
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1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrArg 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLys SorGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAla 

1381 GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
GluPheAsn Asnllelle ProSerSerGln IleThrGln IleProLeu ThrLysSerThr 

1441 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGly GlyAspIleLeu 

1501 CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
ArgArgThr SerProGly GlnlleSerThr LeuArgVal AsnlleThr AlaProLeuSer 

1561 CAGCGCTACC GCGTCCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC 
GlnArgTyr ArgValArg IleArgTyrAla SerThrThr AsnLeuGln PheHisThrSer 

1621 ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCMC 
IleAspGly ArgProIle AsnGlnGlyAsn PheSerAla ThrMetSer SerGlySerAsn 

1681 CTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCMCGGC 
LeuGlnSer GlySerPhe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

1741 AGCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT GTACATCGAC 
SerSerVal PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu ValTyrlleAsp 

1801 CGCATCGAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
ArglleGlu PheValPro AlaGluValThr PheGluAla GluTyrAsp LeuGluArgAla 

1861 CAGAAGGCCG TGAACGAGCT GTTCACCAGC AGCMCCAGA TCGGCCTGAA GACCGACGTG 
GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

1921 ACCGACTACC ACATCGATCA AGTATCCAAT TTAGTTGAGT GTTTATCTGA TGAATTTTGT 
ThrAspTyr HisIleAsp GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 CTGGATGAAA AAAAAGAATT GTCCGAGAAA GTCAAACATG CGAAGCGACT TAGTGATGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 
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2041 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA GACAACTAGA CCGTGGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGTT 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVal 

2161 ACGCTATTGG GTACCTTCGA CGAGTGCTAC CCCACCTACC TGTACCAGAA GATCGACGAG 
ThrLeuLeu GlyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIleAspGlu 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAsp 

2281 CTGGAAATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCCCGGCACC 
LeuGluIle TyrLeuIle ArgTyrAsnAla LysHisGlu ThrValAsn ValProGlyThr 

2341 GGCAGCCTGT GGCCCCTGAG CGCCCCCAGC CCCATCGGCA AGTGCGGGGA GCCGAATCGA 
GlySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysGly GluProAsnArg 

2401 TGCGCTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 
CysAlaPro HisLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

TfifGCCCACC ACAGCCACCA CTTCAGCCTG GACATCGACG TGGGCTGCAC CGACCTGAAC 
cJsAlaSs HiSs«His HisPheSerLeu AspIleAsp ValGlyCys ThrAspLeuAsn 

2521 GAGGACCTGG GCGTGTGGGT GATCTTCAAG ATCAAGACCC AGGACGGCCA CGCCCGCCTG 
GluAspLeu GlyValTrp ValllePheLys IleLysThr GlnAspGly HisAlaArgLeu 

2581 GGCAATCTAG AATTTCTCGA AGAGAAACCA TTAGTAGGAG AAGCACTAGC TCGTGTGAAA 
GlyAsnLeu GluPheLeu GluGluLysPro LeuValGly GluAlaLeu AlaArgValLys 

2641 AGAGCGGAGA AAAMTGGAG AGACAAACGT GAAAAATTGG AATGGGAAAC AAATATTGTT 
ArgAlaGlu LysLysTrp ArgAspLysArg GluLysLeu GluTrpGlu ThrAsnlleVal 

2701 TATAAAGAGG CAAAAGAATC TGTAGATGCT TTATTTGTAA ACTCTCAATA TGATAGATTA 
TyrLysGlu AlaLysGlu SerValAspAla LeuPheVal AsnSerGln TyrAspArgLeu 

2761 CAAGCGGATA CCAACATCGC GATGATTCAT GCGGCAGATA AACGCGTTCA TAGCATTCGA 
GlnAlaAsp ThrAsnlle AlaMetlleHis AlaAlaAsp LysArgVal HisSerlleArg 

2821 GAAGCTTATC TGCCTGAGCT GTCTGTGATT CCGGGTGTCA ATGCGGCTAT TTTTGAAGM 
GluAlaTyr LeuProGlu LeuSerVallle ProGlyVal AsnAlaAla IlePheGluGlu 

2881 TTAGAAGGGC GTATTTTCAC TGCATTCTCC CTATATGATG CGAGAAATGT CATTAAAAAT 
LeuGluGly ArgllePhe ThrAlaPheSer LeuTyrAsp AlaArgAsn VallleLysAsn 

2941 GGTGATTTTA ATAATGGCTT ATCCTGCTGG AACGTGAMG GGCATGTAGA TGTAGAAGAA 
GlyAspPhe AsnAsnGly LeuSerCysTrp AsnValLys GlyHisVal AspValGluGlu 

o nni pAAAACAACC ACCGTTCGGT CCTTGTTGTT CCGGAATGGG AAGCAGAAGT GTCACAAGAA 
Gl^snAsn HisArgSer ValLeuValVal ProGluTrp GluAlaGlu ValSerGlnGlu 
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3061 GTTCGTGTCT GTCCGGGTCG TGGCTATATC 
ValArgVal CysProGly ArgGlyTyrlle 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC 
GlyGluGly CysValThr IleHisGluIle 

3181 AACTGTGTAG AAGAGGMGT ATATCCAAAC 
AsnCysVal GluGluGlu ValTyrProAsn 

3241 ACTCAAGAAG AATATGAGGG TACGTACACT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr 

3301 GAAAGCAATT CTTCTGTACC AGCTGATTAT 
GluSerAsn SerSerVal ProAlaAspTyr 

3361 GATGGACGAA GAGACAATCC TTGTGAATCT 
AspGlyArg ArgAspAsn ProCysGluSer 

3421 CCAGCTGGCT ATGTGACAAA AGAATTAGAG 
ProAlaGly TyrValThr LysGluLeuGlu 

3481 GAGATCGGAG AAACGGAAGG AACATTCATC 
GluIleGly GluThrGlu GlyThrPhelle 

3541 GAATAA 
Glu— 



CTTCGTGTCA CAGCGTACAA GGAGGGATAT 
LeuArgVal ThrAlaTyr LysGluGlyTyr 

GAGAACAATA CAGACGAACT GMGTTTAGC 
GluAsnAsn ThrAspGlu LeuLysPheSer 

AACACGGTAA CGTGTAATGA TTATACTGCG 
AsnThrVal ThrCysAsn AspTyrThrAla 

TCTCGTAATC GAGGATATGA CGGAGCCTAT 
SerArgAsn ArgGlyTyr AspGlyAlaTyr 

GCATCAGCCT ATGAAGAAAA AGCATATACA 
AlaSerAla TyrGluGlu LysAlaTyrThr 

AACAGAGGAT ATGGGGATTA CACACCACTA 
AsnArgGly TyrGlyAsp TyrThrProLeu 

TACTTCCCAG AAACCGATAA GGTATGGATT 
TyrPhePro GluThrAsp LysValTrpIle 

GTGGACAGCG TGGAATTACT TCTTATGGAG 
ValAspSer ValGluLeu LeuLeuMetGlu 



Fig. 11A 



1 ATGGACAACA ACCCCAACAT CAACGAGTGC 
MetAspAsn AsnProAsn IleAsnGluCys 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG 
ValGluVal LeuGlyGly GluArglleGlu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC 
SerLeuThr GlnPheLeu LeuSerGluPhe 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC 
ValAspIle IleTrpGly IlePheGlyPro 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG 
GluGlnLeu IleAsnGln ArglleGluGlu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG 
ProThrAsn ProAlaLeu ArgGluGluMet 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG 
LeuThrThr AlallePro LeuPheAlaVal 

481 TACGTGCAGG CCGCCAACCT GCACCTGAGC 
TyrValGln AlaAlaAsn LeuHisLeuSer 

541 CGCTGGGGCT TCGACGCCGC CACCATCAAC 
ArgTrpGly PheAspAla AlaThrlleAsn 

601 GGCAACTACA CCGACCACGC CGTGCGCTGG 
GlyAsnTyr ThrAspHis AlaValArgTrp 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC 
ProAspSer ArgAspTrp IleArgTyrAsn 

721 CTGGACATCG TGAGCCTGTT CCCCAACTAC 
LeuAspIle ValSerLeu PheProAsnTyr 

781 AGCCAGCTGA CCCGCGAGAT TTACACCAAC 
SerGlnLeu ThrArgGlu IleTyrThrAsn 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC 
ArgGlySer AlaGlnGly IleGluGlySer 

901 AACAGCATCA CCATCTACAC CGACGCCCAC 
AsnSerlle ThrlleTyr ThrAspAlaHis 

961 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC 
IleMetAla SerProVal GlyPheSerGly 



ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
IleProTyr AsnCysLeu SerAsnProGlu 

ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ThrGlyTyr ThrProIle AspIleSerLeu 

GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
ValProGly AlaGlyPhe ValLeuGlyLeu 

AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
SerGlnTrp AspAlaPhe LeuValGlnlle 

TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
PheAlaArg AsnGlnAla IleSerArgLeu 

GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
AlaGluSer PheArgGlu TrpGluAlaAsp 

CGCATCCAGT TCAACGACAT GAACAGCGCC 
ArglleGln PheAsnAsp MetAsnSerAla 

CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
GlnAsnTyr GlnValPro LeuLeuSerVal 

GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
ValLeuArg AspValSer ValPheGlyGln 

AGCCGCTACA ACGACCTGAC CCGCCTGATC 
SerArgTyr AsnAspLeu ThrArgLeuIle 

TACAACACCG GCCTGGAGCG CGTGTGGGGT 
TyrAsnThr GlyLeuGlu ArgValTrpGly 

CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
GlnPheArg ArgGluLeu ThrLeuThrVal 

GACAGCCGCA CCTACCCCAT CCGCACCGTG 
AspSerArg ThrTyrPro IleArgThrVal 

CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
ProValLeu GluAsnPhe AspGlySerPhe 

ATCCGCAGCC CCCACCTGAT GGACATCCTG 
IleArgSer ProHisLeu MetAspIleLeu 

CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
ArgGlyGlu TyrTyrTrp SerGlyHisGln 

CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
ProGluPhe ThrPhePro LeuTyrGlyThr 



Fig. 11B 



1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrArg 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCA'ACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyT.hrSer SerAsnLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

11? 1 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAla 

1 ™i GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
1381 Q^p^g^n ^snllelle ProSerSerGln UeThrGln IleProLeu ThrLysSerThr 

1/1/11 **rrTrrrrA nrGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
1441 A^nLeui? Se?£l$hr Se£vll§ll£?s GlyProGly PheThrGly GlyAspIleLeu 

i Rm rrrraCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
1 SgA?gTh? SeSroily GlnUeSerThr LeuArgVal AsnlleThr AlaProLeuSer 

T5 61 CAGCGCTACC GCGTCCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC 
GlnArgTyr ArgValArg IleArgTyrAla SerThrThr AsnLeuGln PheHxsThrSer 

icoi wrrnrrrcr CCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
1621 ^Q^spGl y Ar gPr o^le As nGlnGl y Asn PheSerAla ThrMetSer SerGlySerAsn 

1fifi1 PTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
1 Shinier GlySerPhe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

n/ii nrrnrrrTrT tc&CCCVGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT GTACATCGAC 
1741 sSsitvSl PheThSS A Se?AlSis§Il PheAsnSer GlyAsnGlu ValTyrlleAsp 

1801 CGCATCGAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
1801 CGCAiCfaAbi p i^ij pro ftlaGluValThr PheG luAla GluTyrAsp LeuGluArgAla 

18 61 CAGAAGGCCG TGAACGAGCT GTTCACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG 
GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

19?1 ACCGACTACC ACATCGATCA GGTGAGCAAC CTGGTGGAGT GCTTAAGCGA CGAGTTCTGC 
$h?AspTyr HisIleAsp GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1QP1 CTGGACGAGA AGAAGGAGCT GAGCGAGAAG GTGAAGCACG CCAAGCGCCT GAGCGACGAG 
SSSpSu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 



Fig. 11C 



2041 CGCAACCTGC TGCAGGACCC CAACTTCCGC GGCATCAACC GCCAGCTGGA CCGCGGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

2101 CGAGGCAGCA CCGATATCAC CATCCAGGGC GGCGACGACG TGTTCAAGGA GAACTACGTG 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVal 

2161 ACCCTGCTGG GCACCTTCGA CGAGTGCTAC CCCACCTACC TGTACCAGAA GATCGACGAG 
ThrLeuLeu GlyThrPhe AspGluCysTyr ProT : hrTyr LeuTyrGln LysIleAspGlu 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAsp 

$ 2281 CTGGAAATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCCCGGCACC 

%f LeuGluIle TyrLeuIle ArgTyrAsnAla LysHisGlu ThrValAsn ValProGlyThr 

m 2341 GGCAGCCTGT GGCCCCTGAG CGCCCCCAGC CCCATCGGCA AGTGCGGGGA GCCGAATCGA 

GlySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysGly GluProAsnArg 

4 : 2401 TGCGCTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 

P CysAlaPro HisLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

2461 TGCGCCCACC ACAGCCACCA CTTCAGCCTG GACATCGACG TGGGCTGCAC CGACCTGAAC 
CysAlaHis HisSerHis HisPheSerLeu AspIleAsp ValGlyCys ThrAspLeuAsn 

2521 GAGGACCTGG GCGTGTGGGT GATCTTCAAG ATCAAGACCC AGGACGGCCA CGCCCGCCTG 
ft! GluAspLeu GlyValTrp ValllePheLys IleLysThr GlnAspGly HisAlaArgLeu 

S 2581 GGCAATCTAG AATTTCTCGA AGAGAAACCA TTAGTAGGAG AAGCACTAGC TCGTGTGAAA 

M GlyAsnLeu GluPheLeu GluGluLysPro LeuValGly GluAlaLeu AlaArgValLys 

2641 AGAGCGGAGA AAAAATGGAG AGACAAACGT GAAAAATTGG AATGGGAAAC AAATATTGTT 
ArgAlaGlu LysLysTrp ArgAspLysArg GluLysLeu GluTrpGlu ThrAsnlleVal 

2701 TATAAAGAGG CAAAAGAATC TGTAGATGCT TTATTTGTAA ACTCTCAATA TGATAGATTA 
TyrLysGlu AlaLysGlu SerValAspAla LeuPheVal AsnSerGln TyrAspArgLeu 

2761 CAAGCGGATA CCAACATCGC GATGATTCAT GCGGCAGATA AACGCGTTCA TAGCATTCGA 
GlnAlaAsp ThrAsnlle AlaMetlleHis AlaAlaAsp LysArgVal HisSerlleArg 

2821 GAAGCTTATC TGCCTGAGCT GTCTGTGATT CCGGGTGTCA ATGCGGCTAT TTTTGAAGAA 
GluAlaTyr LeuProGlu LeuSerVallle ProGlyVal AsnAlaAla IlePheGluGlu 

2881 TTAGAAGGGC GTATTTTCAC TGCATTCTCC CTATATGATG CGAGAAATGT CATTAAAAAT 
LeuGluGly ArgllePhe ThrAlaPheSer LeuTyrAsp AlaArgAsn VallleLysAsn 

2941 GGTGATTTTA ATAATGGCTT ATCCTGCTGG AACGTGAAAG GGCATGTAGA TGTAGAAGAA 
GlyAspPhe AsnAsnGly LeuSerCysTrp AsnValLys GlyHisVal AspValGluGlu 

3001 CAAAACAACC ACCGTTCGGT CCTTGTTGTT CCGGAATGGG AAGCAGAAGT GTCACAAGAA 
GlnAsnAsn HisArgSer ValLeuValVal ProGluTrp GluAlaGlu ValSerGlnGlu 



Fig. 11 D 



3061 GTTCGTGTCT GTCCGGGTCG TGGCTATATC 
ValArgVal CysProGly ArgGlyTyrlle 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC 
GlyGluGly CysValThr IleHisGluIle 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC 
AsnCysVal GluGluGlu ValTyrProAsn 

3241 ACTCAAGAAG AATATGAGGG TACGTACACT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr 

3301 GAAAGCAATT CTTCTGTACC AGCTGATTAT 
GluSerAsn SerSerVal ProAlaAspTyr 

3361 GATGGACGAA GAGACAATCC TTGTGAATCT 
AspGlyArg ArgAspAsn ProCysGluSer 

3421 CCAGCTGGCT ATGTGACAAA AGAATTAGAG 
ProAlaGly TyrValThr LysGluLeuGlu 

3481 GAGATCGGAG AAACGGAAGG AACATTCATC 
GluIleGly GluThrGlu GlyThrPhelle 

3541 GAATAA 
Glu— 



CTTCGTGTCA CAGCGTACAA GGAGGGATAT 
LeuArgVal ThrAlaTyr LysGluGlyTyr 

GAGAACAATA CAGACGAACT GAAGTTTAGC 
GluAsnAsn ThrAspGlu LeuLysPheSer 

AACACGGTAA CGTGTAATGA TTATACTGCG 
AsnThrVal ThrCysAsn AspTyrThrAla 

TCTCGTAATC GAGGATATGA CGGAGCCTAT 
SerArgAsn ArgGlyTyr AspGlyAlaTyr 

GCATCAGCCT ATGAAGAAAA AGCATATACA 
AlaSerAla TyrGluGlu LysAlaTyrThr 

AACAGAGGAT ATGGGGATTA CACACCACTA 
AsnArgGly TyrGlyAsp TyrThrProLeu 

TACTTCCCAG AAACCGATAA GGTATGGATT 
TyrPhePro GluThrAsp LysValTrpIle 

GTGGACAGCG TGGAATTACT TCTTATGGAG 
ValAspSer ValGluLeu LeuLeuMetGlu 
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Fig. 13A 



1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
MetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu IleAsnGln ArglleGluGlu PheAlaArg AsnGlnAla IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr AlaGluSer PheArgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GAACAGCGCC 
ProThrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp MetAsnSerAla 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
LeuThrThr AlallePro LeuPheAlaVal GlnAsnTyr GlnValPro LeuLeuSerVal 

481 TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
TyrValGln AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer ValPheGlyGln 

541 CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
ArgTrpGly PheAspAla AlaThrlleAsn SerArgTyr AsnAspLeu ThrArgLeuIle 

601 GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGT 
GlyAsnTyr ThrAspHis AlaValArgTrp TyrAsnThr GlyLeuGlu ArgValTrpGly 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
ProAspSer ArgAspTrp IleArgTyrAsn GlnPheArg ArgGluLeu ThrLeuThrVal 

721 CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
LeuAspIle ValSerLeu PheProAsnTyr AspSerArg ThrTyrPro IleArgThrVal 

781 AGCCAGCTGA CCCGCGAGAT TTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
SerGlnLeu ThrArgGlu IleTyrThrAsn ProValLeu GluAsnPhe AspGlySerPhe 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHisLeu MetAspIleLeu 



901 



AACAGCATCA CCATCTACAC CGACGCCCAC 
AsnSerlle ThrlleTyr ThrAspAlaHis 



CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
ArgGlyGlu TyrTyrTrp SerGlyHisGln 



961 



ATCATGGCCA GCCCCGTCGG CTTCAGCGGC 
IleMetAla SerProVal GlyPheSerGly 



CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
ProGluPhe ThrPhePro LeuTyrGlyThr 



Fig. 13B 



1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
MotGlyAan AlaAlaPro GlnGlnArgllc ValAlaGln LeuGlyGln GlyValTyrArg 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAla 

HR1 TAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
1381 Qjupj^Q^gn Asnllelle ProSerSerGln IleThrGln IleProLeu ThrLysSerThr 

1/1/11 mrcrrrrrA GrGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
1441 A^nSSIly Serily^h? Ier§alValLys GlyProGly PheThrGly GlyAspIleLeu 

mm r rrraCACC A GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
SgArgTh? SeSroGly GlnUeSerThr LeuArgVal AsnlleThr AlaProLeuSer 

rACCCCTACC GCGTCCGCAT ccgctacgcc agcaccacca acctgcagtt ccacaccagc 

156 SnArgS? AigvIlArg IleAigTyrAla SerThrThr AsnLeuGln PheHxsThrSer 

ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
1621 ^ggCg^ Phe SerAla ThrMetSer SerGlySerAsn 

ifiRi rTrmrACCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
168 Sinter GlysSrPhe Arg?hrValGly PheThrThr ProPheAsn PheSerAsnGly 

1741 ACCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT GTACATCGAC 
sSsSvIl PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu ValTyrlleAsp 

ioni rrranrrarT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
1801 CGCATCGAGT^ PheGluAla GluTyrAsp LeuGluArgAla 

18 61 CAGAAGGCCG TGAACGAGCT GTTCACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG 
GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspvai 

1091 zrrrACTACC ACATCGATCA GGTGAGCAAC CTGGTGGAGT GCTTAAGCGA CGAGTTCTGC 
192 Th5Sp??S HiSleSp SlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 CTGGACGAGA AGAAGGAGCT GAGCGAGAAG GTGAAGCACG CCAAGCGCCT GAGCGACGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 



Fig. 13C 



2041 CGCAACCTGC TGCAGGACCC CAACTTCCGC 
ArgAsnLeu LeuGlnAsp ProAsnPheArg 

2101 CGAGGCAGCA CCGATATCAC CATCCAGGGC 
ArgGlySer ThrAspIle ThrlleGlnGly 

2161 ACCCTGCTGG GCACCTTCGA CGAGTGCTAC 
ThrLeuLeu GlyThrPhe AspGluCysTyr 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG 
SerLysLeu LysAlaTyr ThrArgTyrGln 

2281 CTGGAAATCT ACCTGATCCG CTACAACGCC 
LeuGluIle TyrLeuIle ArgTyrAsnAla 

2341 GGCAGCCTGT GGCCCCTGAG CGCCCCCAGC 
GlySerLeu TrpProLeu SerAlaProSer 

2401 TGCGCTCCGC ACCTGGAGTG GAACCCGGAC 
CysAlaPro HisLeuGlu TrpAsnProAsp 

24 61 TGCGCCCACC ACAGCCACCA CTTCAGCCTG 
CysAlaHis HisSerHis HisPheSerLeu 

2521 GAGGACCTGG GCGTGTGGGT GATCTTCAAG 
GluAspLeu GlyValTrp ValllePheLys 

2581 GGCAATCTAG AGTTCCTGGA GGAGAAGCCC 
GlyAsnLeu GluPheLeu GluGluLysPro 

2641 CGCGCCGAGA AGAAGTGGCG CGACAAGCGC 
ArgAlaGlu LysLysTrp ArgAspLysArg 

2701 TACAAGGAGG CCAAGGAGAG CGTGGACGCC 
TyrLysGlu AlaLysGlu SerValAspAla 

2761 CAGGCCGACA CCAACATCGC CATGATCCAC 
GlnAlaAsp ThrAsnlle AlaMetlleHis 

2821 GAGGCCTACC TGCCCGAGCT GAGCGTGATC 
GluAlaTyr LeuProGlu LeuSerVallle 

2881 CTCGAGGGCC GCATCTTCAC CGCCTTCAGC 
LeuGluGly ArgllePhe ThrAlaPheSer 

2941 GGCGACTTCA ACAACGGCCT GAGCTGCTGG 
GlyAspPhe AsnAsnGly LeuSerCysTrp 

3001 CAGAACAACC ACCGCAGCGT GCTGGTGGTG 
GlnAsnAsn HisArgSer ValLeuValVal 



GGCATCAACC GCCAGCTGGA CCGCGGCTGG 
GlylleAsn ArgGlnLeu AspArgGlyTrp 

GGCGACGACG TGTTCAAGGA GAACTACGTG 
GlyAspAsp ValPheLys GluAsnTyrVal 

CCCACCTACC TGTACCAGAA GATCGACGAG 
ProThrTyr LeuTyrGln LysIleAspGlu 

CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
LeuArgGly TyrlleGlu AspSerGlnAsp 

AAGCACGAGA CCGTGAACGT GCCCGGCACC 
LysHisGlu ThrValAsn ValProGlyThr 

CCCATCGGCA AGTGCGGGGA GCCGAATCGA 
ProIleGly LysCysGly GluProAsnArg 

CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 
LeuAspCys SerCysArg AspGlyGluLys 

GACATCGACG TGGGCTGCAC CGACCTGAAC 
AspIleAsp ValGlyCys ThrAspLeuAsn 

ATCAAGACCC AGGACGGCCA CGCCCGCCTG 
IleLysThr GlnAspGly HisAlaArgLeu 

CTGGTGGGCG AGGCCCTGGC CCGCGTGAAG 
LeuValGly GluAlaLeu AlaArgValLys 

GAGAAGCTGG AGTGGGAGAC CAACATCGTG 
GluLysLeu GluTrpGlu ThrAsnlleVal 

CTGTTCGTGA ACAGCCAGTA CGACCGCCTG 
LeuPheVal AsnSerGln TyrAspArgLeu 

GCCGCCGACA AGCGCGTGCA CAGCATTCGC 
AlaAlaAsp LysArgVal HisSerlleArg 

CCCGGCGTGA ACGCCGCCAT CTTCGAGGAA 
ProGlyVal AsnAlaAla IlePheGluGlu 

CTGTACGACG CCCGCAACGT GATCAAGAAC 
LeuTyrAsp AlaArgAsn VallleLysAsn 

AACGTGAAGG GCCACGTGGA CGTGGAGGAG 
AsnValLys GlyHisVal AspValGluGlu 

CCCGAGTGGG AGGCCGAGGT GAGCCAGGAG 
ProGluTrp GluAlaGlu ValSerGlnGlu 



Fig. 13D 



3061 GTGCGCGTGT GCCCCGGCCG CGGCTACATC CTGCGCGTGA CCGCCTACAA GGAGGGCTAC 
ValArgVal CysProGly ArgGlyTyrlle LeuArgVal ThrAlaTyr LysGluGlyTyr 

3121 GGCGAGGGCT GCGTGACCAT CCACGAGATC GAGAACAACA CCGACGAGCT CAAGTTCAGC 
GlyGluGly CysValThr IleHisGluIle GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGCGTGG AGGAGGAGGT GTACCCCAAC AACACCGTGA CCTGCAACGA CTACACCGCG 
AsnCysVal GluGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTyrThrAla 

3241 ACCCAGGAGG AGTACGAGGG CACCTACACC AGCCGCAACC GCGGCTACGA CGGCGCCTAC 
ThrGlnGlu GluTyrGlu GlyThrTyrThr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3301 GAGAGCAACA GCAGCGTGCC CGCCGACTAC GCCAGCGCCT ACGAGGAGAA GGCCTACACC 
GluSerAsn SerSerVal ProAlaAspTyr AlaSerAla TyrGluGlu LysAlaTyrThr 

3361 GACGGCCGCC GCGACAACCC CTGCGAGAGC AACCGCGGCT ACGGCGACTA CACCCCCCTG 
AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly TyrGlyAsp TyrThrProLeu 

3421 CCCGCCGGCT ACGTGACCAA GGAGCTGGAG TACTTCCCCG AGACCGACAA GGTGTGGATC 
ProAlaGly TyrValThr LysGluLeuGlu TyrPhePro GluThrAsp LysValTrpIle 

3481 GAGATCGGCG AGACCGAGGG CACCTTCATC GTGGACAGCG TGGAGCTGCT GCTGATGGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAspSer ValGluLeu LeuLeuMetGlu 

3541 GAG 
Glu 



Fig. 15 A 



I ATGGACAACA ACCCCAACAT CAACGAGTGC 
MetAspAsn AsnProAsn IleAsnGluCys 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG 
ValGluVal LeuGlyGly GluArglleGlu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC 
SerLeuThr GlnPheLeu LeuSerGluPhe 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC 
ValAspIle IleTrpGly IlePheGlyPro 

241 GAGCAGCTGA TCMCCAGCG CATCGAGGAG 
GluGlnLeu IleAsnGln ArglleGluGlu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG 
ProThrAsn ProAlaLeu ArgGluGluMet 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG 
LeuThrThr AlallePro LeuPheAlaVal 

481 TACGTGCAGG CCGCCAACCT GCACCTGAGC 
TyrValGln AlaAlaAsn LeuHisLeuSer 

541 CGCTGGGGCT TCGACGCCGC CACCATCAAC 
ArgTrpGly PheAspAla AlaThrlleAsn 

601 GGCAACTACA CCGACCACGC CGTGCGCTGG 
GlyAsnTyr ThrAspHis AlaValArgTrp 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC 
ProAspSer ArgAspTrp IleArgTyrAsn 

721 CTGGACATCG TGAGCCTGTT CCCCAACTAC 
LeuAspIle ValSerLeu PheProAsnTyr 

781 AGCCAGCTGA CCCGCGAGAT TTACACCAAC 
SerGlnLeu ThrArgGlu IleTyrThrAsn 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC 
ArgGlySer AlaGlnGly IleGluGlySer 

901 AACAGCATCA CCATCTACAC CGACGCCCAC 
AsnSerlle ThrlleTyr ThrAspAlaHis 

961 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC 
IleMetAla SerProVal GlyPheSerGly 



ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
IleProTyr AsnCysLeu SerAsnProGlu 

ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ThrGlyTyr ThrProIle AspIleSerLeu 

GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
ValProGly AlaGlyPhe ValLeuGlyLeu 

AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
SerGlnTrp AspAlaPhe LeuValGlnlle 

TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
PheAlaArg AsnGlnAla IleSerArgLeu 

GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
AlaGluSer PheArgGlu TrpGluAlaAsp 

CGCATCCAGT TCAACGACAT GAACAGCGCC 
ArglleGln PheAsnAsp MetAsnSerAla 

CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
GlnAsnTyr GlnValPro LeuLeuSerVal 

GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
ValLeuArg AspValSer ValPheGlyGln 

AGCCGCTACA ACGACCTGAC CCGCCTGATC 
SerArgTyr AsnAspLeu ThrArgLeuIle 

TACAACACCG GCCTGGAGCG CGTGTGGGGT 
TyrAsnThr GlyLeuGlu ArgValTrpGly 

CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
GlnPheArg ArgGluLeu ThrLeuThrVal 

GACAGCCGCA CCTACCCCAT CCGCACCGTG 
AspSerArg ThrTyrPro IleArgThrVal 

CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
ProValLeu GluAsnPhe AspGlySerPhe 

ATCCGCAGCC CCCACCTGAT GGACATCCTG 
IleArgSer ProHisLeu MetAspIleLeu 

CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
ArgGlyGlu TyrTyrTrp SerGlyHisGln 

CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
ProGluPhe ThrPhePro LeuTyrGlyThr 



Fig. 15B 



1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC 
MetGlyAsn AlaAlaPro GlnGlnArglle 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT 
ThrLeuSer SerThrLeu TyrArgArgPro 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC 
SerValLeu AspGlyThr GluPheAlaTyr 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG 
TyrArgLys SerGlyThr ValAspSerLeu 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG 
ProProArg GlnGlyPhe SerHisArgLeu 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA 
SerAsnSer, SerValSer IlelleArgAla 

1381 GAGTTCAACA ACATCATCCC CAGCAGCCAG 
GluPheAsn Asnllelle ProSerSerGln 

1441 AACCTGGGCA GCGGCACCAG CGTGGTGAAG 
AsnLeuGly SerGlyThr SerValValLys 

1501 CGCCGCACCA GCCCCGGCCA GATCAGCACC 
ArgArgThr SerProGly GlnlleSerThr 

1561 CAGCGCTACC GCGTCCGCAT CCGCTACGCC 
GlnArgTyr ArgValArg IleArgTyrAla 

1621 ATCGACGGCC GCCCCATCAA CCAGGG^AAC 
IleAspGly ArgProIle AsnGlnGlyAsn 

1681 CTGCAGAGCG GCAGCTTCCG CACCGTGGGC 
LeuGlnSer GlySerPhe ArgThrValGly 

1741 AGCAGCGTGT TCACCCTGAG CGCCCACGTG 
SerSerVal PheThrLeu SerAlaHisVal 

1801 CGCATCGAGT TCGTGCCCGC CGAGGTGACC 
ArglleGlu PheValPro AlaGluValThr 

1861 CAGAAGGCCG TGAACGAGCT GTTCACCAGC 
GlnLysAla ValAsnGlu LeuPheThrSer 

1921 ACCGACTACC ACATCGATCA AGTATCCAAT 
ThrAspTyr HisIleAsp GlnValSerAsn 

1981 CTGGATGAAA AAAAAGAATT GTCCGAGAAA 
LeuAspGlu LysLysGlu LeuSerGluLys 



GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
ValAlaGln LeuGlyGln GlyValTyrArg 

TTCAACATCG GCATCAACAA CCAGCAGCTG 
PheAsnlle GlylleAsn AsnGlnGlnLeu 

GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
GlyThrSer SerAsnLeu ProSerAlaVal 

GACGAGATCC CCCCTCAGAA CAACAACGTG 
AspGluIle ProProGln AsnAsnAsnVal 

AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
SerHisVal SerMetPhe ArgSerGlyPhe 

CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
ProMetPhe SerTrpIle HisArgSerAla 

ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
IleThrGln IleProLeu ThrLysSerThr 

GGCCCCGGCT TCACCGGCGG CGACATCCTG 
GlyProGly PheThrGly GlyAspIleLeu 

CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
LeuArgVal AsnlleThr AlaProLeuSer 

AGCACCACCA ACCTGCAGTT CCACACCAGC 
SerThrThr AsnLeuGln PheHisThrSer 

TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
PheSerAla ThrMetSer SerGlySerAsn 

TTCACCACCC CCTTCAACTT CAGCAACGGC 
PheThrThr ProPheAsn PheSerAsnGly 

TTCAACAGCG GCAACGAGGT GTACATCGAC 
PheAsnSer GlyAsnGlu ValTyrlleAsp 

TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
PheGluAla GluTyrAsp LeuGluArgAla 

AGCAACCAGA TCGGCCTGAA GACCGACGTG 
SerAsnGln IleGlyLeu LysThrAspVal 

TTAGTTGAGT GTTTATCTGA TGAATTTTGT 
LeuValGlu CysLeuSer AspGluPheCys 

GTCAAACATG CGAAGCGACT TAGTGATGAG 
ValLysHis AlaLysArg LeuSerAspGlu 



Fig. 15C 



2041 CGGAATTTAC TTCAAGATCC AAACTTTAGA 
ArgAsnLeu LeuGlnAsp ProAsnPheArg 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA 
ArgGlySer ThrAspIle ThrlleGlnGly 

2161 ACGCTATTGG GTACCTTTGA TGAGTGCTAT 
ThrLeuLeu GlyThrPhe AspGluCysTyr 

2221 TCGAAATTAA AAGCCTATAC CCGTTACCAA 
SerLysLeu LysAlaTyr ThrArgTyrGln 

2281 TTAGAAATCT ATTTAATTCG CTACAATGCC 
LeuGluIle TyrLeuIle ArgTyrAsnAla 

2 341 GGTTCCTTAT GGCCGCTTTC AGCCCCAAGT 
GlySerLeu TrpProLeu SerAlaProSer 

2401 TGCGCTCCGC ACCTGGAGTG GAACCCGGAC 
CysAlaPro HisLeuGlu TrpAsnProAsp 

24 61 TGCGCCCACC ACAGCCACCA CTTCAGCCTG 
CysAlaliis HisSerHis HisPheSerLeu 

2521 GAGGACCTGG GCGTGTGGGT GATCTTCAAG 
GluAspLeu GlyValTrp ValllePheLys 

2581 GGCAATCTAG AATTTCTCGA AGAGAAACCA 
GlyAsnLeu GluPheLeu GluGluLysPro 

2641 AGAGCGGAGA AAAAATGGAG AGACAAACGT 
ArgAlaGlu LysLysTrp ArgAspLysArg 

2701 TATAAAGAGG CAAAAGAATC TGTAGATGCT 
TyrLysGlu AlaLysGlu SerValAspAla 

2761 CAAGCGGATA CCAACATCGC GATGATTCAT 
GlnAlaAsp ThrAsnlle AlaMetlleHis 

2821 GAAGCTTATC TGCCTGAGCT GTCTGTGATT 
GluAlaTyr LeuProGlu LeuSerVallle 

2881 TTAGAAGGGC GTATTTTCAC TGCATTCTCC 
LeuGluGly ArgllePhe ThrAlaPheSer 

2 941 GGTGATTTTA ATAATGGCTT ATCCTGCTGG 
GlyAspPhe AsnAsnGly LeuSerCysTrp 

3001 CAAAACAACC ACCGTTCGGT CCTTGTTGTT 
GlnAsnAsn HisArgSer ValLeuValVal 



GGGATCAATA GACAACTAGA CCGTGGCTGG 
GlylleAsn ArgGlnLeu AspArgGlyTrp 

GGCGATGACG TATTCAAAGA GAATTACGTT 
GlyAspAsp ValPheLys GluAsnTyrVal 

CCAACGTATT TATATCAAAA AATAGATGAG 
ProThrTyr LeuTyrGln LysIleAspGlu 

TTAAGAGGGT ATATCGAAGA TAGTCAAGAC 
LeuArgGly TyrlleGlu AspSerGlnAsp 

AAACACGAAA CAGTAAATGT GCCAGGTACG 
LysHisGlu ThrValAsn ValProGlyThr 

CCAATCGGCA AGTGCGGGGA GCCGAATCGA 
ProIleGly LysCysGly GluProAsnArg 

CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 
LeuAspCys SerCysArg AspGlyGluLys 

GACATCGACG TGGGCTGCAC CGACCTGAAC 
AspIleAsp ValGlyCys ThrAspLeuAsn 

ATCAAGACCC AGGACGGCCA CGCCCGCCTG 
IleLysThr GlnAspGly HisAlaArgLeu 

TTAGTAGGAG AAGCACTAGC TCGTGTGAAA 
LeuValGly GluAlaLeu AlaArgValLys 

GAAAAATTGG AATGGGAAAC AAATATTGTT 
GluLysLeu GluTrpGlu ThrAsnlleVal 

TTATTTGTAA ACTCTCAATA TGATAGATTA 
LeuPheVal AsnSerGln TyrAspArgLeu 

GCGGCAGATA AACGCGTTCA TAGCATTCGA 
AlaAlaAsp LysArgVal HisSerlleArg 

CCGGGTGTCA ATGCGGCTAT TTTTGAAGAA 
ProGlyVal AsnAlaAla IlePheGluGlu 

CTATATGATG CGAGAAATGT CATTAAAAAT 
LeuTyrAsp AlaArgAsn VallleLysAsn 

AACGTGAAAG GGCATGTAGA TGTAGAAGAA 
AsnValLys GlyHisVal AspValGluGlu 

CCGGAATGGG AAGCAGAAGT GTCACAAGAA 
ProGluTrp GluAlaGlu ValSerGlnGlu 



Fig. 15D 



3061 GTTCGTGTCT GTCCGGGTCG TGGCTATATC CTTCGTGTCA CAGCGTACAA GGAGGGATAT 
ValArgVal CysProGly ArgGlyTyrlle LeuArgVal ThrAlaTyr LysGluGlyTyr 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAGTTTAGC 
GlyGluGly CysValThr IleHisGluIle GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTMTGA TTATACTGCG 
AsnCysVal GluGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTyrThrAla 

3241 ACTCAAGAAG AATATGAGGG TACGTACACT TCTCGTAATC GAGGATATGA CGGAGCCTAT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3301 GAAAGCAATT CTTCTGTACC AGCTGATTAT GCATCAGCCT ATGAAGAAAA AGCATATACA 
GluSerAsn SerSerVal ProAlaAspTyr AlaSerAla TyrGluGlu LysAlaTyrThr 

3361 GATGGACGAA GAGACAATCC TTGTGAATCT AACAGAGGAT ATGGGGATTA CACACCACTA 
AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly TyrGlyAsp TyrThrProLeu 

3421 CCAGCTGGCT ATGTGACAAA AGAATTAGAG TACTTCCCAG AAACCGATAA GGTATGGATT 
ProAlaGly TyrValThr LysGluLeuGlu TyrPhePro GluThrAsp LysValTrpIle 

3481 GAGATCGGAG AAACGGAAGG AACATTCATC GTGGACAGCG TGGAATTACT TCTTATGGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAspSer ValGluLeu LeuLeuMetGlu 

3541 GAATAAG 
Glu 



Fig. 23A 

CrvlA(b) Protein Levels in Transgenic Maize 
ELISA Bt Values of Field Plants: 



INBRED X PARENT 


ABRU PLANT Number 


ng Bt/mg protein 


2ND01X171-4A 


1646 


29 


5N984X171-4A 


857 


1705 


RNQfi4X171»4A 


870 


1760 

1 f \J\J 


SNQR4X171-13 

\JINv7w*T/\ III 1 w 


969 


22 


RNQft4X171-1 R 


14R8 


17 


£MQP4Y171 






5N984X171-14A 


1502 


180 


5N984X171-14A 


1529 


1500 


5N984X1 76-11 


1667 


408 


5N984X1 76-11 


1671 


1270 


5N984X1 76-11 


1673 


1522 


5N984X1 76-11 


1675 


943 


5N984X1 76-11 


1679 


967 


5N984X171-4B 


1942 


15 


5N984X171-4B 


1946 


16 


5NA56X171-16ABX 


1101 


30 


5NA89X1 76-11 


1622 


959 


5NA89X1 76-11 


1630 


1172 


5NA89X1 76-11 


1635 


1100 


6F010X171-4 


825 


103 


6F010X171-4 


832 


1298 



-Bt levels are in ng crylA(b)/mg total protein. 

-Data are from progeny of the described maize transformants expressing the crylA(b) 
protein. 

-ELISA analysis of transgenic plant material was carried out using standard procedures 
as described elsewhere. 
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Fig. 23C 



CrvlA(b) Protein Levels in Transgenic Maize 



Greenhouse plants 

35S LINE 

6F010 x 171-4A 
5N984 x 171-1 4A 
6F010X171-16AB 
5N984X 171-13 
5NA89X 171-13 
5N984X 171-18 
6N615X 171-16AB 

PEPC LINE 

6N615 x 176-11 
6F010X 176-10 
5N984X 176-11 



LEAF 

-409 + 288 
256 + 159 
240 + 174 
201 + 94 
37 + 7 
7.7 + 3 
7.5 + 3 



1126 + 419 
774 + 159 
719 + 128 



PITH 

NT 
191 
221 
NT 
150 
NT 
0 



41 

NT 

16 



ROOT 

NT 
198 
271 
NT 
0 

NT 
0 



19 

NT 

20 



POLLEN 

NT 

30 

NT 

NT 

NT 

NT 



NT 
130 
186 



-Bt levels are in ng crylA(b)/mg total protein. 

Data are from progeny of the described maize transformants expressing the crylA(b) 
protein. 

ELISA analysis of transgenic plant material was carried out using standard procedures 
as described elsewhere. 
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EXPRESSION OF THE CRYIA(b) GENE IN TRANSGENIC MAIZE USING THE PITH- 
PREFERRED PROMOTER 

Leaf samples from small plantlets transformed with pCIB4433 using procedures 
described elsewhere were analyzed for the presence of the crylA(b) protein using 
ELISA. All plants expressing crylA(b) were found to be insecticidal in the standard 
European corn borer bioassay. 

Note that the pith-preferred promoter has a low, but detectable level of expression in 
leaf tissue of maize. Detection of CrylA(b) protein is consistent with this pattern of 
expression. 



PLANT NUMBER ng crylA(b)/mg protein 



JS21A-1 TOP 
JS21A-2 TOP 
JS21A-3 TOP 
JS21A-11 TOP 
JS21A-12TOP 
JS21A-13TOP 
JS21A-14TOP 
JS21A-19TOP 
JS21A-24 TOP 
JS21A-28 TOP 
JS22D-3 MID 
JS22D-4 MID 
JS22D-11 MID 
JS22D-17 MID 



169 
0 

113 
127 
112 
97 
118 
82 
0 

154 
2946 
5590 

215 
3004 
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1 GAATTCGGATCCATTAMGMGTCTTTGMCAGATTCTAGAGATCTAGTTTAATGAGCTC 60 

61 CCAAAAGTCTTGAAAAAATTCAGCGGGGAGGCCATTAGGGCAGGGGTACTGTTATGTTTT 120 

121 AMGAGAACACCACTTTCTTGATCTCTTCTAAAGAGAMTGTTTTGTAAGAAGGATCCTG 180 

181 TCCTCCTCATCCAACCTTTTCATCGGCAAATTTTTCATAGAGATATTAGAGGCAAGAGAG 240 
241 GGGCCAAAAAGATCCATGTAAATGGAAGTGGCCACCTGGTTGATACCTCCCTCATCTTCA 300 

301 ACAG MAATCCATTATGAAAAAGTGAATGGATTTTAAACTCTTCTTTTTCTTCCCTTTTG 360 

361 CAATGAGCTGAAAATATCTGGTATTATTCTCATCACCCTCATTAATGAATCTGTCCCTAG 420 

421 CAATTTGCTTTCTCTTGATCCCTTCTGCAGCCACCATGTTTCTTAAATTCCACTCCATAT 480 

481 CAAGCTTTTCCAATCTATCAGAATCTGAGATGGCTGCAATCTCTCTCATTTTCTCAAGGA 540 

541 TATCGATGTTATCCATAAGGTATTTCTTGAACTTCTTATATTTCCCTTCGACATTTATAT 600 

601 TCCATCCTTTCAACATTTTTTTGTTCAATCTTTTTTGTTTTTTTCCTTTCCAAACATCGA 660 

661 TACATTTCCTGCTCCTCACAGGTAAGGACGAGCTTTCAAAAAACCTTCTGCTTTAAAGTC 720 

721 AGGTCTGAGCCTCCAGCAAAGCTCACATATCT AAAGTCCCTCTTCTTAGTTGGGACAGAG 780 

781 TCAGTGCTAAGACACATGGGAACATGACCAGAAAAAAAAAATCATATTTAGCCCAGAGAC 840 

841 AACAATATTCTTGTACTGCAAGTCTCGTTATGGGCTAGCAAAGGAATCTACCCAACTTCT 900 

901 CAAATGTGTTGGGATGTCAAGTATATAGACTATTCATCAGTTCCAACTCTATCAAACTGT 960 

961 GCAGCTCAATTATAGAGTTGAATAAAGTGCTCCATCTATTTGTTCTTATCCTCATATTTG 1020 

1021 GTTAAGATATTMAATCACCTCCCACC AACATTTAAAGTGCACCATTTAAAGTGGCTCGC 1080 

1081 GAGCACCAAACCGCTGAAAACCGGAAATGTTTAGCACGTTGGCAGCGGGACCCTTTTCTA 1140 

1141 TCTCATCGTGTTCTTCGTTGTCCACC ACGGCCCACGGGCCAACGCTCCTCCATCCTGT AG 1200 

1201 TGTAGAGTATATTCC ATTTGCGACCG AGCCGAGCATCGATCCAGCCACACTGGCCACTGC 1260 

84 

1261 CAGCCAGCCATGTGGCACTCCTACGTATACTACGTGAGG TGAGATTCACTCACATQGGA T 1320 

-465 -405 

1321 GGGACCGAGATATTTTACTGCTGTGGTTGTGTGAGAGATAATAAAGCATTTATGACGATT 1380 
1381 GCTGAACAGCACACACCATGCGTCCAGAUAGAGAAAGCTTTCTCTCTTTATTCGC ATGC A 1440 

1441 TGTTTCATTATCTTTTATCATATATATATAACACATATTAAATGATTCTTCGTT CCAAT T 1500 
-285 "226 

1501 TATAATTCATTTGACTTTTTTATCCACCGATGCTCGTTTTATTAAAAAAAATATTATAAT 1560 
-225 -166 

1561 TATTGTTACTTTTTGTTGTAATATTGTTTAGCATATAATAAACTTTGATACTAGTATGTT 1 620 
-165 -106 

49 

1621 TCCGAGCAAAAAAAAATATTAATATTTAGATTACG AGCCCATTAATTAATTATATTCG AG 1680 
-105 "46 

83 +1 
1681 ACAAGCGAAGCAAAGCAAAGCAA GCTAATGTTGCCCCTGCTGTG CATGCAGAGGCCCGCT 1740 
-45 +15 

73******* 

1741 CTTGCTATAAACGAGGCAGCTAGACGCGACTCGACTCATCAGCCTCATCA ACCTCGACGA 1800 

+16 +75 
*************** y 

1801 AGGAGGAACGAACGGA CAGGTTGTTGCACAGAAGCGACATGGCTTTCGCGCCCAAAACGT 1860 
+76 MAFAPKTS +135 

38 
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1861 CCTCCTCCTCCTCGCTGTCCTCGGCGTTGCAGGCAGCTCAGTCGCCGCCGCTGCTCCTGA 1920 
+126 SSSSLSSALQAAQSPPLLLR +195 

40 + 41 

1921 GGCGGATGTCGTCGACCGCAACACCGAGACGGAGGTACGACGC GGCCGTCGTCGTCACTA 1980 
+196 RMS S TAT PRRRY DAAVVVT T +255 

1 981 CCACCACCACTGCTAGAGCTGCGGCGGCTGCTGTCACGGTTCCCGCCGCCCCGCCGCAGG 2040 
+256 T T TARAAAAAVT-VPAAP PQA +315 

75 $ 
2041 CGGGCCGCCGCCGCCGGTGCCACCAAAGCAAGCGGC GGCACCCGCAGAGGAGGAGCCG TC 2100 
+316 GRRRRCHQSKRRHPQRRSRP +375 

2101 CGGTGTCGGACACCATGGCGGCGCTCATGGCCAAGGGCAAGGTTCGTATAGTACGCGCGC 2160 
+376 VSDTMAALMAKGK 

2161 GTGTCGTCGTCGTTATTTTGCGCATAGGCGCGGACATACACGTGCTTTAGCTAGCTAACA 2220 
2221 GCTAG ATCATCGGTGC AGACGGCGTTCATCCCGT ACATCACCGCCGGCGACCCGGACCTA 2280 

TAFIPYITAGDPDL 

2281 GCGACGACGGCCGAGGCGCTGCGTCTGCTGGACGGCTGTGGCGCCGACGTCATCGAGCTG 2340 
ATTAEALRLLDGCGADVIEL 

2341 GGGGTACCCTGCTCGGACCCCTACATCGACGGGCCCATCATCCAGGCGTCGGTGGCGCGG 2400 
GVPCSDPYIDGPIIQASVAR 

2401 GCTCTGGCCAGCGGCACCACCATGGACGCCGTGCTGGAGATGCTGAGGGAGGTGACGCCG 2460 
ALASGTTMDAVLEMLREVTP 

2461 GAGCTGTCGTGCCCCGTGGTGCTCCTCTCCTACTACAAGCCCATCATGTCTCGCAGCTTG 2520 
ELSCPVVLLSYYKPIMSRSL 

2521 GCCGAGATGAAAGAGGCGGGGGTCCACGGTAACTATAGCTAGCTCTTCCGATCCCCCTTC 2580 
AEMKEAGVH 

2581 AATTAATTAATTTATAGTAGTCCATTCATGTGATGATTTTTGTTTTTCTTTTTACTGACA 2640 
2641 GGTCTTATAGTGCCTGATCTCCCGTACGTGGCCGCGCACTCGCTGTGGAGTGAAGCCAAG 2700 
GLIVPDLPYVAAHSLWSEAK 

2701 AACAACAACCTGGAGCTGGTAGGTTGAATTAAGTTGATGCATGTGATGATTTATGTAGCT 2760 
N N N L E L 

2761 AGATCGAGCTAGCTATAATTAGGAGCATATCAGGTGCTGCTGACAACACCAGCCATACCA 2820 

VLLTTPAIP 

2821 GAAGACAGGATGAAGGAGATCACCAAGGCTTCAGAAGGCTTCGTCTACCTGGTAGTTATA 2880 
EDRMKEITKASEGFVYL 

2881 TGTATATATAGATGGACGACGTAACTCATTCCAGCCCCATGCATATATGGAGGCTTCAAT 2940 
2941 TCTGCAGAGACGACGAAGACCACGACGACGACTAACACTAGCTAGGGGCGTACGTTGCAG 3000 

3001 GTGAGCGTGAACGGAGTGACAGGTCCTCGCGCAAACGTGAACCCACGAGTGGAGTCACTC 3060 
VSVNGVTGPRANVNPRVESL 



Fig. 24C 



3061 ATCCAGGAGGTTAAGAAGGTGACTMCMGCCCGTTGCTGTTGGCTTCGGCATATCCAAG 3120 
IQEVKKVTNKPVAVGFGI SK 

3121 CCCGAGCACGTGAAGCAGGTACGTACGTAGCTGACCAAAAAAAACTGTTAACAAGTTTTG 3180 
1? E 11 V K 

3181 TTTGACAAGCCGGCTACTAGCTAGCTAACAGTGATCAGTGACACACACACACACACAGAT 3240 

Q I 

3241 TGCGCAGTGGGGCGCTGACGGGGTGATCATCGGCAGCGCCATGGTGAGGCAGCTGGGCGA 3300 
AQWGADGVI IGSAMVRQLGE 

3301 AGCGGCTTCTCCCAAGCAAGGCCTGAGGAGGCTGGAGGAGTATGCCAGGGGCATGAAGAA 3360 
AAS PKQGLRRLEEYARGMKN 
+++ 

3361 CGCGCTGCCATGAGTCC ATGACAAAGTAAAACGT AC AGAGACACTTGATAATATCTATCT 3420 
ALP ^ Ann 
3 All ATCATCTCGGAGAAGACGACCGACCAATAAAAATAAGCCAAGTGGAAGTGAAGCTTAGCT 3480 

3481 GTATATACACCGTACGTCGTCGTCGTCGTTCCGGATCGATCTCGGCCGGCTAGCTAGCAG 3540 
3541 AACGTGTACGTAGTAGTATGTAATGCATGGAGTGTGGAGCTACTAGCTAGCTGGCCGTTC 3600 
3601 ATTCGATTATAATTCTTCGCTCTGCTGTGGTAGCAGATGTACCTAGTCGATCTTGTACGA 3660 

3661 CGAAGAAGCTGGCT AGCT AGCCGTCTCGATCGT AT ATGTACTGATTAATCTGC AGATTGA 3720 

$ 

3721 ATMAAACTACAGTACGCATATGATGCGTACGTACGTGTGTATAGTTTGTGCTCATATAT 3780 
3781 GCTCCTCATCACCTGCCTGATCTGCCCATCGATCTCTCTCGTACTCCTTCCTGTTAAATG 3840 
3841 CCTTCTTTGACAGACACACCACCACCAGCAGC AGTGACGCTCTGCACGCCGCCGCTTTAA 3900 
3901 GACATGTAAGATATTTTAAGAGGTATAAGATACCAAGGAGCACAAATCTGGAGCACTGGG 3960 
3961 ATATTGCAAAGACAAAAAAAAAACAAAATTAAAGTCCCACCAAAGTAGAGATAGTAAAGA 4020 
4021 GGTGGATGGATTAAAATTATCTCATGATTTTTGGATCTGCTCAAATAGATCGATATGGTA 4080 
4081 TTCAGATCTATGTTGTATAGCCTTTTCATTAGCTTTCTGAAAAAAAAATGGTATGATGAG 4140 
4141 TGCGGAGTAGCTAGGGCTGTGAAGGAGTCGGATGGGCTTCCACGTACTTGTTTGTGGCCC 4200 
4201 TAGTCCGGTTCT ATTTAGGTCCGATCCGAGTCCGGCATGGTCCGGTTCCATACGGGCTAG 4260 
4261 GACCAAGCTCGGCACGTGAGTTTTAGGCCCGTCGGCTAGCCCGAGCACGACCCGTTTTTA 4320 
4321 AACTGGCTAGGACTCGCCCATTTAATAAGACAAACATTGCAAAAAATAGCTCTATTTTTT 4380 
4381 ATTTAAAATATATTGTTTATTTGTGAAATGTGTATTATTTGTMTATATATTATTGTATA 4440 
4441 TAGTTATATCTTCAATTATGATTTATAAATATGTTTTTTATTATGAACTCAATTTTAAGT 4500 
4501 TTGATTTATGCGTTGGCGGGCTCGAGGAGGCACGGTGAACATTTTTGGGTCGGGCTTAAC 4560 
4561 GGGTCGGCCCGGCCCGGTTCGGCCCATCCACGGCCCATCCCGTGTCGGCCTCGTTCGGTG 4620 
4621 AGTTCAGCCCGTCGGACAACCCGTCCCCGGCCCGGATAATTAATCGGGCCTAACCGTGGC 4 680 
4681 GTGCTTAAACGGTCCGTGCCTCAACGGACCGGGCCGCGGGCGGCCCGTTTGACATCTCTA 4740 
4741 GTGGTGTGATTAGAGATGGCGATGGGAACCGATCACTGATTCCGTGTGGAGAATTCGATA 4800 
4801 TCAAGCTTATCGATACC 4817 

Entire sequence of the maize TrpA gene, with introns 

and exons, transcription and translation strats, start and 

stop of cDNA. . , 

$ = start and end of cDNA; +1 = transcription start; 73******* = 
primer extension primer; t = start of translation; +++ = stop 

codon; = CCAAT Box, TATAA Box, poly A addition site. 

# above underlined sequences are PCR primers. 




21 1D 5N984 



39d 46d 
S L R P SH LP UP' LP MP UP' 




Northern blot showing differential expression of 
TrpA gene in maize tissues. 2 hour exposure against 
film at -80C with Dupont Cronex intensifying screens. 

Fig. 25A 
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39d 46d 
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S L R P SH LP UP LP MP UP 




Northern blot showing differential expression of 
TrpA gene in maize tissues. 4 hour exposure against 
film at -80C with Dupont Cronex intensifying screens. 

Fig. 25B 




211 D 5N984 



39d 46d 
S L R P SH LP UP' LP MP UP 




Northern blot showing differential expression of 

TrpA gene in maize tissues. 1 8 hour exposure against 

film at -80C with Dupont Cronex intensifying screens. 

Fig. 25C 
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21 1D 5N984 

" 39d 46d ' 

S L R P SH LP UP' LP MP UP 




Northern blot showing differential expression of 

TrpA gene in maize tissues. 48 hour exposure against 

film at -80C with Dupont Cronex intensifying screens. 

Fig. 25D 



Northern blot showing maize TrpA gene expression in 
Funk lines 21 1D and 5N984 leaf and pith and the absense 
of expression in 21 1D seed total RNA. 
65 hour exposure against film at -80C with Dupont Cronex 
intensifying screens. 

Fig. 26 



10X 5X 1X S H EV E B 




Genomic southern of Funk line 211 D probed with the 
TrpA cDNA 8-2. B = BamHI, E = EcoRI, EV = EcoRV, 
H = Hindlll and S = Sacl. 

120 hour exposure against film at -80C with Dupont Cronex 
intensifying screens. 



Fig. 27 




Primer extension showing the transcription start of TrpA 
gene and sequencing ladder. 

1 hour exposure against film at -80C with Dupont Cronex 
intensifying screens. 

Fig. 28A 



42°C 48°C 



54°C 




123 



RNase protection of region from +2 bp to +387 bp 
with three annealing temperatures. 
16 hour exposure against film at -80C with Dupont Cronex 
intensifying screens. 



Fig. 28B 



Fig. 3 OB 



969 CTG GAT AGA GAA GAG CAC CTT TAC ACA GCA TTC CAG TAT TTC GAC MG GAC MC AGC 
323>Leu Asp Arg Glu Glu His Leu Tyr Thr Ala Phe Gin Tyr Phe Asp Lys Asp Asn Ser 

1026 GGG TAC ATT ACT AM GAA GAG CTT GAG CAC GCC TTG AAG GAG CAA GGG TTG TAT GAC 
342>Gly Tyr He Thr Lys Glu Glu Leu Glu His Ala Leu Lys Glu Gin Gly Leu Tyr Asp 

1083 GCC GAT AAA ATC AAA GAC ATC ATC TCC GAT GCC GAC TCT GAC AAT GAT GGA AGG ATA 
361^Ala Asp Lys He Lys Asp He lie Ser Asp Ala Asp Ser Asp Asn Asp Gly Arg He 

1140 GAT TAT TCA GAG TTT GTG GCG ATG ATG AGG AAA GGG ACG GCT GGT GCC GAG CCA ATG 
380^Asp Tyr Ser Glu Phe Val Ala Met Met Arg Lys Gly Thr Ala Gly Ala Glu Pro Met 

1197 AAC ATC AAG AAG AGG CGA GAC ATA GTC CTA TAG TGAAGTGAAGCAGCAAGTGTGTAATGTAATGTG 
399Wsn lie Lys Lys Arg Arg Asp He Val* Leu ... 

1263 TATAGCAGCTCAAACAAGCAAATTTGTACATCTGTACACAAATGCAATGGGGTTACTTTTGCAAAAAAAAAAAAAAA 




Fig. 30A 

Maize Pollen CDPK cDNA sequence 
sequence contained in clones pCIB3168 and pCIB3169 

1 TG CAG ATC ATG CAC CAC CTC TCC GGC CAG CCC AAC GTG GTG GGC CTC CGC GGC GCG 
]>Gln He Met His His Leu Ser Gly Gin Pro Asn Val Val Gly Leu Arg Gly Ala 



57 TAC GAG GAC AAG CAG AGC GTG CAC CTC 
19>Tyr Glu Asp Lys Gin Ser Val His Leu 

Aval 

114 TTC GAC CGC ATC ATC GCC CGG GGC CAG 
38^Phe Asp Arg He He Ala Arg Gly Gin 

171 CGC GCC ATC GTG CAG ATC GTG CAC ACC 
57>Arg Ala He Val Gin He Val His Thr 

Aval 

228 ATC AAG CCC GAG AAC TTC 
76Hle Lys Pro Glu Asn Phe 

285 ACC GAC TTC GGC CTC TCC GTC TTC TTC 
95^Thr Asp Phe Gly Leu Ser Val Phe Phe 

Aval 

342 GGC AGC GCC TAC TAC ATC GCG CCC GAG 
114>Gly Ser Ala Tyr Tyr He Ala Pro Glu 

399 GAC ATC TGG AGC GTC GGC GTC ATG CTC 
133^Asp He Trp Ser Val Gly Val Met Leu 

456 TGG GCA GAG AAC GAG AAC GGC ATC TTC 
152^Trp Ala Glu Asn Glu Asn Gly He Phe 

513 TCC AGC GAG CCA TGG CCA CAC ATC TCG 
171>Ser Ser Glu Pro Trp Pro His He Ser 



GTC ATG GAG CTG TGC GCG GGC GGG GAG CTC 
Val Met -Glu Leu Cys Ala Gly Gly Glu Leu 



TAC ACG GAG CGC GGC GCC GCG GAG CTG CTG 
Tyr Thr Glu Arg Gly Ala Ala Glu Leu Leu 

TGC CAC TCC ATG GGG GTG ATG CAC CGG GAC 
Cys His Ser Met Gly Val Met His Arg Asp 



CCG CTC AAG GCC 
Pro Leu Lys Ala 

AAG GAG GGC GAG CTG CTC AGG GAC ATC GTC 
Lys Glu Gly Glu Leu Leu Arg Asp He Val 



GTG CTC AAG AGG AAG TAC GGC CCG GAG GCC 

Val Leu Lys Arg Lys Tyr Gly Pro Glu Ala 

TAC ATC TTC CTC GCC GGC GTG CCT CCC TTC 

Tyr He Phe Leu Ala Gly Val Pro Pro Phe 

ACC GCC ATC CTG CGA GGG CAG CTT GAC CTC 

Thr Ala He Leu Arg Gly Gin Leu Asp Leu 

CCG GGA GCC AAG GAT CTC GTC AAG AAG ATG 

Pro Gly Ala Lys Asp Leu Val Lys Lys Met 



CTG CTG CTC AGC AAG GAC GAG GAC GCG 
Leu Leu Leu Ser Lys Asp Glu Asp Ala 



570 CTC AAC ATC AAC CCC AAG GAG CGG CTC ACG GCG TTC CAG GTC CTC AAT CAC CCA TGG 
190^Leu Asn He Asn Pro Lys Glu Arg Leu Thr Ala Phe Gin Val Leu Asn His Pro Trp 

627 ATC AAA GAA GAC GGA GAC GCG CCT GAC ACG CCG CTT GAC AAC GTT GTT CTC GAC AGG 
209Hle Lys Glu Asp Gly Asp Ala Pro Asp Thr Pro Leu Asp Asn Val Val Leu Asp Arg 

684 CTC AAG CAG TTC AGG GCC ATG AAC CAG TTC AAG AAA GCA GCA TTG AGG ATC ATA GCT 
228^Leu Lys Gin Phe Arg Ala Met Asn Gin Phe Lys Lys Ala Ala Leu Arg He He Ala 

741 GGG TGC CTA TCC GAA GAG GAG ATC ACA GGG CTG AAG GAG ATG TTC AAG AAC ATT GAC 
247>Gly Cys Leu Ser Glu Glu Glu He Thr Gly Leu Lys Glu Met Phe Lys Asn He Asp 

798 AAG GAT AAC AGC GGG ACC ATT ACC CTC GAC GAG CTC AAA CAC GGG TTG GCA AAG CAC 
266>Lys Asp Asn Ser Gly Thr He Thr Leu Asp Glu Leu Lys His Gly Leu Ala Lys His 

855 GGG CCC AAG CTG TCA GAC AGC GAA ATG GAG AAA CTA ATG GAA GCA GCT GAC GCT GAC 
285>Gly Pro Lys Leu Ser Asp Ser Glu Met Glu Lys Leu Met Glu Ala Ala Asp Ala Asp 

EcoRI 

912 GGC AAC GGG TTA ATT GAC TAC GAC GAA TTC GTC ACC GCA ACA GTG CAT ATG AAC AAA 
304K51y Asn Gly Leu He Asp Tyr Asp Glu Phe Val Thr Ala Thr Val His Met Asn Lys 



Fig. 29 

EcoRI 
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Fig. 31 
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Fig. 37C 



2 641 AGAGCGGAGA 2VGWkTGGAG AGACAAACGT 
ArgAlaGlu LysLysTrp ArgAspLysArg 

2701 TATAAAGAGG CAAAAGAATC TGTAGATGCT 
TyrLysGlu AlaLysGlu SerValAspAla 

2761 CAAGCGGATA CCAACATCGC GATGATTCAT 
GlnAlaAsp ThrAsnlle AlaMetlleHis 

2821 GAAGCTTATC TGCCTGAGCT GTCTGTGATT 
GluAlaTyr LeuProGlu LeuSerVallle 

2881 TTAGAAGGGC GTATTTTCAC TGCATTCTCC 
LeuGluGly ArgllePhe ThrAlaPheSer 

2941 GGTGATTTTA ATAATGGCTT ATCCTGCTGG 
GlyAspPhe AsnAsnGly LeuSerCysTrp 

3001 CAAAACAACC ACCGTTCGGT CCTTGTTGTT 
GlnAsnAsn HisArgSer ValLeuValVal 

3061 GTTCGTGTCT GTCCGGGTCG TGGCTATATC 
ValArgVal CysProGly ArgGlyTyrlle 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC 
GlyGluGly CysValThr IleHisGluIle 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC 
AsnCysVal GluGluGlu ValTyrProAsn 

3241 ACTCAAGAAG AATATGAGGG TACGTACACT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr 

3301 GAAAGCAATT CTTCTGTACC AGCTGATTAT 
GluSerAsn SerSerVal ProAlaAspTyr 

3361 GATGGACGAA GAGACAATCC TTGTGAATCT 
AspGlyArg ArgAspAsn ProCysGluSer 

3421 CCAGCTGGCT ATGTGACAAA AGAATTAGAG 
ProAlaGly TyrValThr LysGluLeuGlu 

3481 GAGATCGGAG AAACGGAAGG AACATTCATC 
GluIleGly GluThrGlu GlyThrPhelle 

3541 GAATAA 
Glu 



GAAAAATTGG AATGGGAAAC AAATATTGTT 
GluLysLeu GluTrpGlu ThrAsnlleVal 

TTATTTGTAA ACTCTCAATA TGATAGATTA 
LeuPheVal AsnSerGln TyrAspArgLeu 

GCGGCAGATA AACGCGTTCA TAGCATTCGA 
AlaAlaAsp LysArgVal HisSerlleArg 

CCGGGTGTCA ATGCGGCTAT TTTTGAAGAA 
ProGlyVal AsnAlaAla IlePheGluGlu 

CTATATGATG CGAGAAATGT CATTAAAAAT 
LeuTyrAsp AlaArgAsn VallleLysAsn 

AACGTGAAAG GGCATGTAGA TGTAGAAGAA 
AsnValLys GlyHisVal AspValGluGlu 

CCGGAATGGG AAGCAGAAGT GTCACAAGAA 
ProGluTrp GluAlaGlu ValSerGlnGlu 

CTTCGTGTCA CAGCGTACAA GGAGGGATAT 
LeuArgVal ThrAlaTyr LysGluGlyTyr 

GAGAACAATA CAGACGAACT GAAGTTTAGC 
GluAsnAsn ThrAspGlu LeuLysPheSer 

AACACGGTAA CGTGTAATGA TTATACTGCG 
AsnThrVal ThrCysAsn AspTyrThrAla 

TCTCGTAATC GAGGATATGA CGGAGCCTAT 
SerArgAsn ArgGlyTyr AspGlyAlaTyr 

GCATCAGCCT ATGAAGAAAA AGCATATACA 
AlaSerAla TyrGluGlu LysAlaTyrThr 

AACAGAGGAT ATGGGGATTA CACACCACTA 
AsnArgGly TyrGlyAsp TyrThrProLeu 

TACTTCCCAG AAACCGATAA GGTATGGATT 
TyrPhePro GluThrAsp LysValTrpIle 

GTGGACAGCG TGGAATTACT TCTTATGGAG 
ValAspSer ValGluLeu LeuLeuMetGlu 



Fig. 37B 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA CCATGTTCCG CAGTGGCTTC 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAla 

1381 GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
GluPheAsn Asnllelle ProSerSerGln IleThrGln IleProLeu ThrLysSerThr 

1441 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGly GlyAspIleLeu 

i Rni rrrrrrACCA GfCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCCTGAGC 
A?gA?gTh5 Se?p5oGly GlnlleSerThr LeuArgVal AsnlleThr AlaProLeuSer 

l<=ifi1 rAGCGCTACC GCGTCCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC 
GlnA?gTyr ArgValArg IleArgTyrAla SerThrThr AsnLeuGln PheHxsThrSer 

uo1 Arnrrnrrrrr rrrrr ATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
1621 niSpGl^gpSSl^HGlnGlSAS PheSerAla ThrMetSer SerGlySerAsn 

1681 TTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
SSclnler GlySerPhe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

1741 ArCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT GTACATCGAC 
Se?sS?Val PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu ValTyrlleAsp 

iftm rrc ATCGAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
1801 CGCATCGAGT p TCtlbCL^ iaGiuVaiThr pheGluAla GluTyrAs p LeuGluArgAla 

iqsi raraarrrrG TGAACGAGCT GTTCACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG 
18 GlnL?sAla vIlttnGlu teuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

1921 JgS^^ISS^iS^iSSK s^^ss^t^^ 

2101 sgg^i^ffi^isaiiig gg^vsss^ssgssi 

2161 ^^^^^^^ 

2221 ss^L^^^SbSS^ ssssss? T^isssMSsajssg 

228 1 gAG^TCTA^TTC^J^TGCC ^^^^.Sg^ 

23 41 G fyi CCTTAT^ 

9 4m TrrrCTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 
2 CysAlaPro HisLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

2521 iss G G s^ G »isa?f {nssgfGSJspg^JsaJSsa 



Fig. 37A 



1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
NetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu IleAsnGln ArglleGluGlu PheAlaArg AsnGlnAla IleSerArgLeu 

-?ni CArrrrrTGA GrAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
301 giSgiSlIS SerA^nLeu TyrGlSlleTyr AlaGluSer PheArgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GAACAGCGCC 
ProThrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp MetAsnSerAla 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
LeuThrThr AlallePro LeuPheAlaVal GlnAsnTyr GlnValPro LeuLeuSerVal 

481 TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
TyrValGln AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer ValPheGlyGln 

541 CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
ArgTrpGly PheAspAla AlaThrlleAsn SerArgTyr AsnAspLeu ThrArgLeuIle 

fim rrc AACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGT 
601 il^AfS?? ThrSpSs klkvalArgTrp TyrAsnThr GlyLeuGlu ArgValTrpGly 

661 TCCGACAGCC GCGACTGGAT CAGGTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
P^oAspf er ArglspSp IleArgTyrAsn GlnPheArg ArgGluLeu ThrLeuThrVal 

no-\ rrrrACArCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
721 Sspne vIlterLeS PheProAsnTyr AspSerArg ThrTyrPro IleArgThrVal 

7R1 arfVAGCTGA CCCGCGAGAT TTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 
781 sSSSefl*rA?5GirileTyrThrAsn ProValLeu GluAsnPhe AspGlySerPhe 

341 rCCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
Sgllyler AlacYnGly IleGluGlySer IleArgSer ProHisLeu MetAspIleLeu 

901 AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
AsnSerlle ThrlleTyr ThrAspAlaHis ArgGlyGlu TyrTyrTrp SerGlyHxsGln 

aTraTrrrrA GCCCCGTCGG CTTCAGCGGC CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
961 flSe?Ala SertSSvS GlyPhiserGly ProGluPhe ThrPhePro LeuTyrGlyThr 

1091 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
Metilytsn AlXlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrArg 

10R1 ArrrTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
1081 ThSSSsS siSSfeu ?yrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGlulle ProProGln AsnAsnAsnVal 



Fig. 35E 



pol CDPK gene Map (1 > 4165) Site and Sequence 



TCAGCCGACAAACTAAACTATAGAAACCACATCATGATATCAAATTTTGAGGTGGCGGTGCTACAGAAATAGAACCCAGT 
> » »- -J — « 1 ^ 1 1 * 1 » . 1 1 l 36OO 



•INTRON6- 



ACACCAAAATGACTAACTTGTCATGATTAGTTGTTCCTCGTAACTGAACATTTGTGTTCTTAGTTTCTTATTGTTAAACC 
' * > 1 ■ 1 ■ 1 «— — ■ — ' — *■ 3680 



•1NTRON6- 



AAAGACTTAAATTCACTTTTGCACATGCAGGATGGAAGGATAGATTATTCAGAGTTTGTGGCGATGATGAGGAAAGGGAC 
, — — l- — i i . ■ . . « i . 1 i i i 3760 



■INTRON 6- 



-EXON7- 



GGCTGGTGCCGAGCCAATGAACATCAAGAAGAGGCGAGACATAGTCCTATAGTGAAGTGAAGCAGWAAGTGTGTAATGTA 

. ^ 1 i < ' » • 1 • «- 3840 



■EXON7- 



ATGTGTATAGCAGCTCAAACAAGCAAATTTGTACATCTGTACACAAATGCAATGGGGTTACTTTTGCAACTTAGTTCATG 
i — 1 uj i j .... i i « i ■ — — - l 3920 

GATGGTTGTGTACGTTGTGCTATTGATTGCAAGTGATTTGAAAGACATGCATACTTAGGAACTGAGAAAGATAGATCTAC 
i , . i 1 1 — ■ i * " 1 ' — * — 4000 

TACTGCTAGAGACAGAACAATAGGATKKYAATTCAGYAAGTGYGTATTTCAGAAGACTACAGCTGGCATCTATTATTCTC 

■ i i i t i i ■ ■ 1 . . . I . -i- I ■ . .... i , r , ■ . . I 1 »■ 4080 

ATTGTCCTCGCAAAAATACTGATGATGCATTTGAGAGAACAATATGCAACAAGATCGAGCTCCCTATAGTGAGTCGTATT 
, _i * ■ i ■ ■ ■ ' 1 ■ L 4160 

AGGCC 

— <■+ 4165 
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pjICDPKqene Map (1 >4165) Site and Sequence 



GGGTACATTATCTGATAAAAGCTCCACAAATACAACTTCTGAAGAACAGCAATGCTTACAC6GCAGAATTTTCATTATAA 

• ' 1 — ■ ' 1 —J ' ■ ■ ■ ' 2880 



=> 



■INTRON3- 



ATGCTCTTGATGACATAATGTTAGATCATAGCTGGGTGCCTATCCGAAGAGGAGATCACAGGGCTGAAGGAGATGTTCAA 

• ... ...... . . . i . ' — ' '■ 2960 



1 




1NTRON 3 1 


EXON 4 



GAACATTGACAAGGATAACAGCGGGACCATTACCCTCGACGAGCTCAAACACGGGTTGGCAAAGCACGGGCCCAAGCTGT 

.... ■ - i - ■ 1 ■ • ' i ■ . . . i 3010 



•EXON 4- 



CAGACAGCGAAATGGAGAAACTAATGGAAGCAGTGAGTTTTCAGAGTACAATCTTAAAAAAAGGAATTGTGATTCTTTTC 

. . i - i ■ 1 ...... i . '**"** 1 ■ . ■ i ' ■ ■ ' 1 3i^u 

» 



_ /■ 

■ EXON 4 1 — INTRON 4- 



AAAATGAAGAAGTAATCTGAAAACATCCCTGCTGAAATGCTTTATACATTTCCAGGCTGACGCTGACGGCAACGGGTTAA 

........ i . ... I ' • i - - ... . ■ I . ■ i i ■ i i I i i i i l 



3200 



-I 



•INTRON 4- 



• EXON 5- 



,EcoRI 

TTGACTACGACGAATTCGTCACCGCAACAGTGCATATGAACAAACTGGATAGAGAAGAG CACCTTTACACAGCATTCCAG 

....... i ...... . i ' - - . ' ... .1 i I i ■ ■ i i o^oU 



■EXON 5- 



: EcoR I 

TATTTCGACAAGGACAACAGCGGGTAAGTTGAACGTTAAAATGATACAGCTGGTACCTGAATTCTGGACAACACATATCA 



3360 



> 



EXON 5 -* INTRON 5- 

TAACAGGACACATATATAATTCGTTTATCTCACAGGTACATTACTAAAGAAGAGCTTGAGCACGCCTTGAAGGAGCAAGG ^ 







INTRON 5 1 


-EXON 6 



GTTGTATGACGCCGATAAAATCAAAGACATCATCTCCGATGCCGACTCTGACAATGTAAGGAAC AAACATTATTTAAATT 

i ... i .. .. t . .... i . . . ... I i i ■ . I ■ i 00«£(J 

^ 



Fig. 35C 

pol CDPK gene Map (1 > 4165) Site and Sequence 



Ava I 

I 

CGTGCAGATCGTGCACACCTGCCACTCCATGGGGGTGATGCACCGGGACATCAAGCCCGAGAACTTCCTGCTGCTCAGCA 



2160 



•EXON1- 



AGGACGAGGACGCGCCGCTCAAGGCCACCGACTTCGGCCTCTCCGTCTTCTTCAAGGAGGGCGAGCTGCTCAGGGACATC 

■ ■ ■ - ■ ■ » ' ■ ■ 1 1 ■ ■ 1 — 1 ■ ' 2210 



■EXON1- 



Ava I 

GTCGGCAGCGCCTACTACATCGCGCCCGAGGTGCTCAAGAGGAAGTACGGCCCGGAGGCCGACATCTG GAGCQTCGGCGT 

. .. . i ....... i . i i. i ' 2320 



•EX0N1- 



Bam HI 

i 

CATGCTCTACATCTTCCTCGCCGGCGTGCCTCCCTTCTGGGCAGGTCGGATCCGTCCGTGTTCGTCCTAGACGATATACA ^ 



> 



. EXON 1 ' ' »NTRON 1 

GAACCCGACGATGGATTTGCTTCTCAGCCCTGTTCTTGCATCACCAGAGAACGAGAACGGCATCTTCACCGCCATCCTGC ^ 



•INTRON 1- 



■EXON 2- 



CARRGCAGCTTGACCTCTCCAGCGAGCCATGGCCACACATCTCGCCGGGAGCCAAGGA TCTCGTCAAGAAGATGCTCAAC 

. . - ■ ■ [ ■ ' 1 ..I. i i ■ . . ' ^OOU 



-EXON 2- 



ATCAACCCCAAGGAGCGGCTCACGGCGTTCCAGGTCCTCAGTAAGTACCCAGATCGTTGCTGTCATACACTCATATGAAT ^ 



EXON 2- 1 ■ INTRON 2 ; 

TGTATCGTTCATGAGCAACGATCGAGCGGATTTGGTGAACTTGTAGATCACCCATGGAT CAAAGAAGACGGAGACGCGCC 

.. I I - - , i ■ - - - - ' - - - i .... I ■ I ■ ■■« I " * * * ' c.tc\J 



■ INTRON 2- 



■ EXON 3- 



TGACACGCCGCTTGACAACGTTGTTCTCGACAGGCTCAAGCAGTTCAGGGCCATGAACCAGTTCAAGAAAGCAGCATTGA 

....... I - ~ ' - - - 1 - - - * ■ ... I .... i . i ■ ■ 1 m *,,! — . — „i I-—*.. ...-i- 2oUU 



• EXON 3- 



Fig. 35B 

pol CDPK gene Map (1 > 4165) Site and Sequence 



Xbal 

i 

CAAAATCTAGAAACCTTTTTTTTTCCTCCCGATACGCCCCTCCATCTCTCGCCGTTCATGTCCGTGGCTGGCTGCCCTCC 

■ ■ ■ ■ ■ ■ 1440 

l mRNA start J 

GTGGGAGCAGGCGGCCGCACTCGTTCCCCGCCGCAGCCATGGGCCAGTGCTGCTCCAAGGGCGCCGGAGAGGCCCCGCCA 

■ ' - - . .... i ... i ''*'* ' i i i i i . i .... i i 1520 



EXON 1 

CCGAGGCGCCAAACGGCAGGCGCCAAGCCGCGGGCGTCCGCGAACAACGCCGACGGACAACGGGCGTCGTCCTC GTCCGC 

1 ... t - - - . . . t - ~ - - ■ - - - - i - - - * .... 1 .... t .... I I I 



1600 



■EXON1- 



GGTGGCTGCTGCCGCTGCTGCTGCCGGTGGTGGTGGCGGCGGCACGACGAAGCCGGCCTCACCCACCGGCGGCGCCAGGG 

......... i - « ... ...... i ..... i i .. i . ... i .... i .... i i ]fc)oU 



■EXON1- 



CCAGCTCCGGCAGCAAACCGGCGGCGGCCGTGGGCACGGTGCTGGGCCGGCCCATGGAGGACGTGCGCGCGACCTACTCG ^ 



•EXON1- 



Ava I 

! 



ATGGGCAAGGAGCTCGGGCGCGGGCAGTTCGGCGTGACGCACCTGTGCACGCACCGGACGAGCGGCGAGAAGCTGGCGTG 

...... t ........ i - - - . t ..... . ^-t-. ■ . . i ■ . . ■ ■ ■ ■ I « ■ i- I OnU 



■EXON1- 



CAAGACGATCGCGAAGCGGAAGCTGGCGGCCAGGGAGGACGTGGACGACGTGCGGCGGGAGGTGCAGATCATGCACCACC 



•EXON1- 



TCTCCGGCCAGCCCAACGTGGTGGGCCTCCGCGGCGCGTACGAGGACAAGCAGAGCGTGCACCTCGTCATGGAGCTGTGC 



EXON1 ~ 

Ava I 

TCGACCGCATCATCGCCCGGGGCCAGTACACGGAGCGCGGCGCCGCGGAGCTGCTGCGCGCCAT 



Fig. 35A 

pol CDPK gene Map (1 > 4165) Site and Sequence 

Enzymes : 6 of 1 98 enzymes (Filtered) 
Settings : Circular, Certain Sites Only. Standard Genetic Code 



TTAGTAACACCTCTCCAATCGCTTGGGTTGGCACATTCTTAGCTTTTATCACATTTTAAGAAATAGAGTTCACCACCTTC 
>•-'■•■-'-••■'••■ — i .... t .... i 1 * ' ' 



80 



AAAATATGCCTATACAATGAATGATGCTTGGATGCAATATAGCTAGATTCAACTAGCTATATATGGTCAATAGAACCCTG 

...-.-« . 1 ■ * ■ -J ■ ■ ' 1 160 

TGAGCACCTCACAAACACGACTTCAATTTTGAGACCCTAAGCGAGTAAATGGTTAAAGTC CTCTTATTATTAGTCTTAGG 

ACTTCTCCTTGCTAAATGCTTGTCAGCGATCTATATATCTTCCCCACTGCGGGAGATACTATATATAGGGCCTTGGACCT 

. . . ■ . . . i ' ■ ■ ■ i i i ' i i t .... i .... i 320 

CTAGGGTATCTCAAAGGCCTAGTCACAACAATTCTCAACAGTATTTAATTTTATACATGTATG AACAGTGTAGGAATTTG 

........ i . - . . f ... i - . i-*- ■ ■ ■ ■ ■ « * »■ " » * ■ * ' * »■»■'* * ■*-*■' 4UU 

AGTGCCCAACCCAAGAGTGGGAGGTGTAAATTGGGTAGCTAAACTTAAATAGGGCTCTTCTTAT TTAGGTTTATCTAGTC 

» . i . . - . .... i .... i. i ... ... - . i i * 1 qoU 

TCT ACTTAGACTAATTCAGAAAGAATTTTACAACCTATGGTTAATCATATCTCTAGTCTAAGCAAATTTAGGAAAGTTAA ^ 
AA GCACACAATTAGGCACATGTGAAAGATGTGTATGGTAAGTAAAAGACTTATAAG6AAAAAGT6GGTGAATCCTCAAGA ^ 
TGTGGTGGTATATCCCAATGATATTAGATGCCAGAATATAGGGGGGAAATCGATGTATACCATCTCTACCAGGATACCTG ^ 
TGCGGACTGTGCAACTGACACATGGACCATGGTGTCTTCTTAGATTTGGTTATTAGCTAATTGCGCTACAACTTGTTCAA ^ 
GGCTAGACCAAATTAAAAAACTAATATTAAACATAAAAAGTTAGGCAAACTATAGTAAATTATGCAGCGATCCAACAACA ^ 
AGCCATGTCTCGTGGGTCATGAGCCACGCGTCGGCCATACACCCACATGATGTTTCCATACGGATGGTCCTTATGCAATT ^ 
TTGTCTGCAAAACACAAGCCTTAATACAGCCACGCGACAATCATGGAAGTGGTCGTTTTAGGTCCTCATCATGAAGTTCA ^ 
GGGAAAACGCATCAAATGTAATGCAGAGAAATGGTATTTCTTCTCTTGTAAATCA6GGAGAGGAGTACCATCAGTACAGA ^ 

.EcoRI 

TTCAGAATCAGAATTCAGTCTTCCAACGACAATAATCGCAGCATCTTGTAAAAATTTGCAGAAACTTCTGTTTGACTTGT ^ 

AGCCCTG ACCTTTGCAAATATTTGAAGTTGTGCCTGCTGACACAACTTCAATCTGGAAGTGCTGTTGATCAGTTTTGCCA ^ 

GAAACAGCAAGCAGCCTATATATATCTGTCACGAGACACCCTGCCGCCCTCTTCTTTCCCGCCATTCCCTCCCTACCCTT ^ 



Fig. 34 



Lipman-Pearson Protein Alignment 

Gap Penalty: 2; Gap Length Penalty: 12 

Seql Seq2 Similarity Gap Gap Consensus 
pol CDPK ptn soybean CDPK ptn Index Number Length Length 
1>551 1>509 62,4 .1 1 464 

pol CDPK ptn VLGRPMEDVRATYSMGKELGRGQFGVTHLCTHRTSGEKLACKTIAKRKLAAREDVDDVRREVQIMHHLSG 150 

II :. :::|..|.:|:.||:||||.| 1 1 : 1 : 1 1 • 1 : 1 1 1 : 1 = 1 1 1 1 : 1 1 : 1 1 : 1 1 : 1 1 1 1 1 1 1 • 

soybean CDPK ptn VLPQRTQNIREVYEVGRKLGQGQFGTTFECTRRASGGKFACKSIPKRKLLCKEDYEDVWREIQIMHHLSE 91 

pol CDPK ptn QPNWGLRGAYEDKQSVHLVMELCAGGELFDRnARGQYTERGMELLRAIVQiyHTCHSMGyMHRpiKP 220 

"III • 1:111. :llll III MM! Mil: .:|:|:H II |:::l l::l ::l 1 1 :l 1 1 1 1 1 M I 
soybean CDPK ptn HA1WRIEGTYEDSTAVHLVMELCEGGELFDRIVQKGHYSERQAARLIKTIVEWEACHSLGVMHRDLKP 161 

pol CDPK ptn ENFLLLSKDEDAPLKATDFGLSVFFKEGELLRDIVGSAYYIAPEyLKRKYGPEADIWSVG 290 

1 1 1 1 ■ • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 II : I : II 1 : 1 1 : 1 1 1 1 1 : : 1 1 1 1 : 1 : 1 1 • 1 1 : 1 1 1 : 1 : 1 1 
soybean CDPK ptn ENFLFDTIDEDAKLMTDFGLSVFYKPGESFC^ 231 

pol CDPK ptn PPFWAENENGIFTAILRGQLDLSSEPWPHISPGAKDLV™ 360 
soybean CDPK PPFWAESEPGIFRQILLGK^ 301 

pol CDPK ptn PLDNWLDRLKQFRMQFKKMLRIIAGCLSEEEITGLKEMFM 430 
soybean CDPK ptn PLDSAVLSRLKQFSM 371 

pol CDPK ptn SDSEMEKIiMEMDADGNGLIDYDEFVTATVHMNKLDREEHLY^ 500 
soybean CDPK ptn MESEIKDLMDAADI^ 441 

pol CDPK ptn YDADKIKDIISDADSDNDGRIDYSEFVAMMRKGTAGAEPMNIKK 544 
soybean CDPK ptn -DDIhIdDMIKEiUnDGQIDYGEFM 484 



Fig. 33 



Lipman-Pearson Protein Alignment 

Gap Penalty: 2; Gap Length Penalty: 12 

Seql Seq2 Similarity . Gap Gap Consensus 

pol CDPK ptn humcama ptn Index Number Length Length 

1>551 1>150 40.3 2 2 142 



pol CDPK ptn IiSEERITGLKRMFKNIDKDNSGTITLDELKHGLAKHGPKLSDSEMEKLMEAADADGNGLIDYDEFVTATV 460 

L:EE:I:.:KE F. :DKD. .GTIT .EL : . G. : :: :E: :.:::. .DADGNG ID: EF:T . 
humcama ptn LTEEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGNGTIDFPEFLTMMA 74 



pol CDPK ptn HMNKL-DREEHLYTAFQYFDKDNSGYITKEELEHALKEQGLYDADKIKDI-ISDADSDNDGRIDYSEFVA 528 

: M: . D.EE: : ,AF: FDKD. :GYI: .EL H. : . : G ..:.:.: :I. :AD D.DG: : :Y.EFV. 
humcama ptn RKMKDTDSEEE I REAFRVKDKDGNGY I SAAELRHVMTNLGEKLTDEEVDEMI READI DGDGQVNYEEFVQ 144 



pol CDPK ptn 
humcama ptn 



MM 530 
MM 

MM 146 



Fig. 32 



Lipman-Pearson Protein Alignment 

Gap Penalty: 2; Gap Length Penalty: 12 

Seql Seq2 Similarity Gap Gap Consensus 

pol CDPK ptn rat pk2 ptn Index Number Length Length 

1>551 1>528 3675 4 4 291 



pol CDPK ptn YSMGKELGRGQFGVTHLCTHRTSGEKLACKTIAKRKLAAREDVDDVRREVQIMHHLSGQPNWGLRGAYE 162 

Y : .ELG:G.F:V. : C..:TS.:. A K.I. . :KL:AR: : :.: RE.:I : L. :PN:V L::: . 
rat pk2 ptn YQLFEELGKGAFSWRRCVKKTSTQEYAAKIINTKKLSARDH-QKLEREARICR1LK-HPNIVRLHDSIS 81 



pol CDPK ptn DKQSVHLVMELCAGGELFDRIIARGQYTERGAAELLRAIVQIVHTCHSMGVMHRDIKPENFLLLSKDEDA 232 

:. .LV.:. :GGELF: I:AR. Y:E :A:: ::.I:: V: H : : :HRD:KPEN:LL SK .:A 
rat pk2 ptn EEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIHQILESVNHIHQHDIVHRDLKPENLLLASKCKGA 151 



pol CDPK ptn PLKATDFGLSVFFK-EGELLRDIVGSAYYIAPEVL-KRKYGPEADIWSVGVMLYIFLAGVPPFWAENENG 300 

::K :DFGL: : :..: : ::.G:: Y::PEVL::. YG .DIW: GV:LYI:L.G PPFW.E::: 
rat pk2 ptn AVKLADFGLAIEVQGEQQAWFGFAGTPGYLSPEVLRKDPYGKPVDIWACGVILYILLVGYPPFWDEDQHK 221 



pol CDPK ptn IFTAILRGQLDLSSEPWPHISPGAKDLVKKMLNINPKERLTAFQVLNHPWIKEDGDAPDTPLDNWLDRL 370 

:: .1 G. D::S W : :P.AK:L: : :ML.INP .R:TA Q.L.-HPW: : :..:. : .:: L 
rat pk2 ptn LYQQIKAGAYDFPSPEWDTVTPEAKNLINQMLTINPAKRITADQALKHPWVCQRSTVASMMHRQETVECL 291 



pol CDPK ptn KQFRAMNQFKKAALRI I 387 

::F.A..::K hi.: 
rat pk2 ptn RKFNARRKLKGAILTTM 308 



